BatchNorm and Dropout #19

dfdx · 2017-12-21T09:52:59Z

Does it make sense to put functional forms of BatchNorm and Dropout into NNlib so that other packages could simply import them from here?

MikeInnes · 2018-01-23T13:56:26Z

Yeah, that's a great idea.

staticfloat · 2019-04-11T01:12:42Z

BatchNorm is going to be tricky because of its "statistics update step". The current thinking behind this with Zygote is to do something along the lines of https://gist.github.com/staticfloat/a509b1e1cb1fb556028779722c2531e6

ToucheSir · 2021-05-30T16:40:14Z

Now that Flux's normalization interface has been re-worked and GPU batchnorm moved from CUDA.jl -> NNlibCUDA, perhaps we should revisit this. The only reason https://github.com/FluxML/Flux.jl/tree/master/src/cuda exists at all now is to accommodate a non-standard implementation of batchnorm, so getting rid of that would be great.

CarloLucibello · 2021-06-01T10:14:31Z

I was looking into porting the functional form of normalization layers here, but I'm not sure how to handle the Zygote.ignore block without having NNlib depend on Zygote

DhairyaLGandhi · 2021-06-01T14:09:02Z

The concern has been raised earlier and is fixed by FluxML/Flux.jl#1509

ToucheSir · 2021-06-14T18:05:44Z

I don't think there's any reason these dropout functions need to live in Flux. Shall we move them over? Happy to volunteer for a copy-paste PR if we're all in agreement. This would also unblock FluxML/Flux.jl#1572.

Use `CUDA.@atomic`

ToucheSir mentioned this issue May 31, 2021

PyTorch feature parity FluxML/Flux.jl#1431

Open

92 tasks

ToucheSir mentioned this issue Jun 14, 2021

Move over adjoints from Flux FluxML/Zygote.jl#81

Closed

ToucheSir mentioned this issue May 9, 2022

Front page example broken LuxDL/Lux.jl#17

Closed

ToucheSir mentioned this issue Jan 3, 2023

Add norm functions #452

Open

2 tasks

ToucheSir pushed a commit that referenced this issue Feb 13, 2023

Merge pull request #19 from darsnack/kd/fix-@atomic

af81ef6

Use `CUDA.@atomic`

ToucheSir pushed a commit that referenced this issue Feb 13, 2023

Merge pull request #19 from darsnack/kd/fix-@atomic

57230de

Use `CUDA.@atomic`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BatchNorm and Dropout #19

BatchNorm and Dropout #19

dfdx commented Dec 21, 2017

MikeInnes commented Jan 23, 2018

staticfloat commented Apr 11, 2019

ToucheSir commented May 30, 2021

CarloLucibello commented Jun 1, 2021

DhairyaLGandhi commented Jun 1, 2021

ToucheSir commented Jun 14, 2021 •

edited

Loading

BatchNorm and Dropout #19

BatchNorm and Dropout #19

Comments

dfdx commented Dec 21, 2017

MikeInnes commented Jan 23, 2018

staticfloat commented Apr 11, 2019

ToucheSir commented May 30, 2021

CarloLucibello commented Jun 1, 2021

DhairyaLGandhi commented Jun 1, 2021

ToucheSir commented Jun 14, 2021 • edited Loading

ToucheSir commented Jun 14, 2021 •

edited

Loading