Skip to content

Latest commit

 

History

History
16 lines (11 loc) · 900 Bytes

README.md

File metadata and controls

16 lines (11 loc) · 900 Bytes

Motivation

Usage

To reproduce the two cases, please run the following codes.

$ cd ./DyingReLU
# or use `cd ./OscillatingLoss`
$ python train.py
  1. The test case in 'DyingReLU' direction shows that training problem occurrence is highly random. In our previous test, DyingReLU problem occurs in 50% of the cases, and 50% of the cases will behave normally and achieve high training accuracy.
  2. The test case in 'OscillatingLoss' direction shows that the time when a training problem occurs is random. In our previous test, Oscillating Loss problem occurs in the first 10 training epoch in 50% of the cases. In other cases, this problem happens in later epochs and 29% cases even don't perform this problem in the training.

avatar