Tutorial -- Clarify on how we perform optimization #90

nicolay-r · 2021-03-18T05:46:42Z

AREkit/contrib/networks/context/architectures/base/base.py

Line 151 in ac07e88

    
           logits_unscaled, logits_unscaled_dropped = self.init_logits_unscaled(context_embedding)

NOTE:
This should be moved and clarified into another repository, which is related to benchmark results for RuSentRel-1.2

nicolay-r · 2021-03-24T06:28:31Z

We may refer to this work:
https://arxiv.org/abs/2006.13730
which is relies on this paper in terms of SGD application, bags teminology, instances selection within bags:
https://www.aclweb.org/anthology/D15-1203.pdf

Since the later already provides the correct description.
The slight problem in paper that is describes MAX towards the labels rather than bags.
So for sample gradients within bags we adopt avg function, where the main assumption is that we take into account other synonymous attitudes.
We use this feature in earlier works (https://github.com/nicolay-r/sentiment-pcnn/tree/clls-2018)
Anyway, since in last research we adopt BagSize = 1, it means that we do not exploit this feature.

in the original approach https://www.aclweb.org/anthology/D15-1203.pdf,
authors select a best instance j-th within a bag, where best denotes a max value of p(y_i|m_i,j) across all other values within a bag. This way we obtain Loss function on bags level and hence use the result value in order to update Theta using stochastic SGD (using AdaDelta)

nicolay-r added the documentation Improvements or additions to documentation label Mar 18, 2021

nicolay-r self-assigned this Mar 18, 2021

nicolay-r closed this as completed Aug 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tutorial -- Clarify on how we perform optimization #90

Tutorial -- Clarify on how we perform optimization #90

nicolay-r commented Mar 18, 2021 •

edited

Loading

nicolay-r commented Mar 24, 2021 •

edited

Loading

Tutorial -- Clarify on how we perform optimization #90

Tutorial -- Clarify on how we perform optimization #90

Comments

nicolay-r commented Mar 18, 2021 • edited Loading

nicolay-r commented Mar 24, 2021 • edited Loading

nicolay-r commented Mar 18, 2021 •

edited

Loading

nicolay-r commented Mar 24, 2021 •

edited

Loading