Aggregated adamw update #16398

drivanov · 2019-10-08T21:21:08Z

Description

MxNet operator for aggregated Adam update

Checklist

Essentials

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Code is well-documented:
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

New operator allows to make Adam update for multiple gradients in one kernel.
tests, (and when applicable, API doc)
Test, for previously used "single" Adam update modified and now it also includes the use of clip_gradient parameter and random variations for lr, eta, wd and shape.

ptrendx · 2019-10-08T21:48:34Z

@eric-haibin-lin FYI

python/mxnet/ndarray/contrib.py

src/operator/contrib/adamw-inl.h

tests/python/gpu/test_operator_gpu.py

and some minor changes requested by Przemek

python/mxnet/ndarray/contrib.py

ptrendx

LGTM. @eric-haibin-lin do you have any more comments?

eric-haibin-lin

The comment below is not specific to this PR:
It looks like there are lots of code changes to register operators for multi_xx_update ops. What do you think would be helpful to reduce the development cycle for such ops? As we add more ops like this, the c++ code becomes less readable. Maybe TVM can help generate multi-tensor kernels?

…to aggregated_adamw_update

ptrendx · 2019-10-16T16:33:43Z

Generally I would opt for cleaning the optimizers so that only the multi_ versions of the optimizers code exist (so use the same code for both multi and single tensor versions). The next step would be to refactor common code for handling multiple tensors out.

eric-haibin-lin · 2019-10-19T23:51:29Z

Thank you @drivanov @ptrendx

* Trigger CI * MxNet operator for aggregated Adam update * Fixing problem with getRescaleGrad(...) call in Python2 and some minor changes requested by Przemek * Fix a problem appearing in Python2 * Minor cleanup * Changing function name * Trigger CI * Eliminating "asnumpy()" conversion * Trigger CI

drivanov added 7 commits May 22, 2019 09:25

Trigger CI

3f2b68e

Merge branch 'master' of https://github.com/drivanov/incubator-mxnet

f203d9a

Merge branch 'master' of https://github.com/apache/incubator-mxnet

f1fe0ad

Merge branch 'master' of https://github.com/apache/incubator-mxnet

e0faa63

Merge branch 'master' of https://github.com/apache/incubator-mxnet

2dc607d

Merge branch 'master' of https://github.com/apache/incubator-mxnet

0c6402e

MxNet operator for aggregated Adam update

7af1a08

drivanov requested a review from szha as a code owner October 8, 2019 21:21

ptrendx mentioned this pull request Oct 8, 2019

[Discussion] 1.6.0 Roadmap #15589

Closed

ptrendx reviewed Oct 8, 2019

View reviewed changes

python/mxnet/ndarray/contrib.py Outdated Show resolved Hide resolved

ptrendx reviewed Oct 8, 2019

View reviewed changes

src/operator/contrib/adamw-inl.h Outdated Show resolved Hide resolved

ptrendx reviewed Oct 8, 2019

View reviewed changes

src/operator/contrib/adamw-inl.h Outdated Show resolved Hide resolved

ptrendx reviewed Oct 8, 2019

View reviewed changes

tests/python/gpu/test_operator_gpu.py Show resolved Hide resolved

drivanov added 2 commits October 9, 2019 09:52

Fixing problem with getRescaleGrad(...) call in Python2

98bf030

and some minor changes requested by Przemek

Fix a problem appearing in Python2

960e948

eric-haibin-lin reviewed Oct 11, 2019

View reviewed changes

python/mxnet/ndarray/contrib.py Outdated Show resolved Hide resolved

Minor cleanup

4a8f39c

eric-haibin-lin reviewed Oct 11, 2019

View reviewed changes

python/mxnet/ndarray/contrib.py Outdated Show resolved Hide resolved

drivanov added 2 commits October 11, 2019 14:48

Changing function name

f7252b1

Trigger CI

7631737

ptrendx approved these changes Oct 15, 2019

View reviewed changes

eric-haibin-lin reviewed Oct 16, 2019

View reviewed changes

drivanov added 2 commits October 16, 2019 09:31

Merge branch 'master' of https://github.com/apache/incubator-mxnet in…

ed15015

…to aggregated_adamw_update

Eliminating "asnumpy()" conversion

5e7e675

Trigger CI

a43c672

eric-haibin-lin merged commit ffec31f into apache:master Oct 19, 2019

drivanov deleted the aggregated_adamw_update branch October 21, 2019 14:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggregated adamw update #16398

Aggregated adamw update #16398

drivanov commented Oct 8, 2019 •

edited

Loading

ptrendx commented Oct 8, 2019

ptrendx left a comment

eric-haibin-lin left a comment

ptrendx commented Oct 16, 2019

eric-haibin-lin commented Oct 19, 2019 •

edited

Loading

Aggregated adamw update #16398

Aggregated adamw update #16398

Conversation

drivanov commented Oct 8, 2019 • edited Loading

Description

Checklist

Essentials

Changes

ptrendx commented Oct 8, 2019

ptrendx left a comment

Choose a reason for hiding this comment

eric-haibin-lin left a comment

Choose a reason for hiding this comment

ptrendx commented Oct 16, 2019

eric-haibin-lin commented Oct 19, 2019 • edited Loading

drivanov commented Oct 8, 2019 •

edited

Loading

eric-haibin-lin commented Oct 19, 2019 •

edited

Loading