Add Initial support for ContextNet Encoder and CTC Decoder #630

titu1994 · 2020-05-13T20:16:38Z

Changelog

Added

Add ContextNetEncoder, ContextNetDecoderForCTC neural modules to ASR collection
Add stride_last flag which allows stride and repeat flags to be used simultaneously. It will perform the strided convolution at the final Conv-BN-ReLU sub-block.
Add swish as optional activation function
Add zero_infinity flag to CTCLoss, default to False.
Adds integration test for ContextNetEncoder and ContextNetDecoderForCTC

Modified

Update Squeeze and Excitation sub-module to support different context sizes, support different activation
- Change default se_reduction_ratio to 8 instead of 16.
SpecAugment now supports either an integer or floating point value for time_width.
- If float is passed, adaptively uses it as percentage of current timesteps that should be cut.

Note: Currently, examples/asr/contextnet.py uses JasperDecoderForCTC instead of ContextNetDecoderForCTC. This will be updated in a future PR once full support is present.

Signed-off-by: smajumdar <[email protected]>

okuchaiev

few small comments

okuchaiev · 2020-05-13T21:43:41Z

nemo/collections/asr/contextnet.py

+logging = nemo.logging
+
+
+class ContextNetEncoder(TrainableNM):


Should this inherit from JasperEncoder ?

On second thought, it probably should not inherit JasperEncoder. While yes currently they share exactly same functionality, in the future they will not. In that case, the __init__ call will instantiate multiple JasperBlocks before ContextNetEncoder starts to instantiate its own values.

While there is duplication for now, it is cleaner to separate the two modules

nemo/collections/asr/parts/jasper.py

nemo/collections/asr/contextnet.py

nemo/collections/asr/losses.py

nemo/collections/asr/parts/jasper.py

Signed-off-by: smajumdar <[email protected]>

lgtm-com · 2020-05-14T21:45:49Z

This pull request introduces 1 alert when merging 8c81303 into a22d325 - view on LGTM.com

new alerts:

1 for Unused import

Signed-off-by: smajumdar <[email protected]>

blisc · 2020-05-14T21:51:16Z

examples/asr/contextnet.py

+
+    # (ContextNet uses the Jasper baseline encoder and decoder)
+    encoder = nemo_asr.ContextNetEncoder(
+        feat_in=contextnet_params["AudioToMelSpectrogramPreprocessor"]["features"],


Just a note that you can add this inside the yaml itself.
See https://confluence.atlassian.com/bitbucket/yaml-anchors-960154027.html

Thanks for the hint !

Signed-off-by: smajumdar <[email protected]>

lgtm-com · 2020-05-14T23:31:41Z

This pull request introduces 1 alert when merging 81330ba into a22d325 - view on LGTM.com

new alerts:

1 for Unused import

Signed-off-by: smajumdar <[email protected]>

* Add SE + context SE support Signed-off-by: smajumdar <[email protected]> * Add contextnet components Signed-off-by: smajumdar <[email protected]> * Add ContextNet support Signed-off-by: smajumdar <[email protected]> * Add config files Signed-off-by: smajumdar <[email protected]> * Correct configs Signed-off-by: smajumdar <[email protected]> * Add streaming speech command Signed-off-by: smajumdar <[email protected]> * Add kernel size factor argument Signed-off-by: smajumdar <[email protected]> * Add docstrings Signed-off-by: smajumdar <[email protected]> * Update CHANGELOG.md Signed-off-by: smajumdar <[email protected]> * Add integration tests Signed-off-by: smajumdar <[email protected]> * Style fixes and add docstrings for se_reduction_ratio Signed-off-by: smajumdar <[email protected]> * Style fixes in tests Signed-off-by: smajumdar <[email protected]> * Correct CHANGELOG.md Signed-off-by: smajumdar <[email protected]> * Correctios to docstrings Signed-off-by: smajumdar <[email protected]> * Add WandB support to contextnet.py Signed-off-by: smajumdar <[email protected]> * Style fixes Signed-off-by: smajumdar <[email protected]> * Remove unused import Signed-off-by: smajumdar <[email protected]> * Refactor ContextNetEncoder to subclass JasperEncoder Signed-off-by: smajumdar <[email protected]> * Remove unused imports Signed-off-by: smajumdar <[email protected]> Signed-off-by: ZeroCool <[email protected]>

Use a single jinja template for the prompts with and without a document. Also remove the conditionals checking for te presence of a document. Fixes NVIDIA#629 Signed-off-by: Derek Higgins <[email protected]>

titu1994 added 13 commits May 13, 2020 12:14

Add SE + context SE support

5f17859

Signed-off-by: smajumdar <[email protected]>

Add contextnet components

aa28939

Signed-off-by: smajumdar <[email protected]>

Add ContextNet support

86e3dbd

Signed-off-by: smajumdar <[email protected]>

Add config files

8a5c6de

Signed-off-by: smajumdar <[email protected]>

Correct configs

39c1a91

Signed-off-by: smajumdar <[email protected]>

Add streaming speech command

28a4cb1

Signed-off-by: smajumdar <[email protected]>

Add kernel size factor argument

646dd8f

Signed-off-by: smajumdar <[email protected]>

Add docstrings

d990c5c

Signed-off-by: smajumdar <[email protected]>

Update CHANGELOG.md

e14d5d5

Signed-off-by: smajumdar <[email protected]>

Add integration tests

d526488

Signed-off-by: smajumdar <[email protected]>

Style fixes and add docstrings for se_reduction_ratio

851350e

Signed-off-by: smajumdar <[email protected]>

Style fixes in tests

2e208f6

Signed-off-by: smajumdar <[email protected]>

Correct CHANGELOG.md

46cc5c7

Signed-off-by: smajumdar <[email protected]>

okuchaiev requested review from blisc and okuchaiev May 13, 2020 21:19

okuchaiev requested changes May 13, 2020

View reviewed changes

titu1994 added 3 commits May 13, 2020 15:35

Correctios to docstrings

a8d7f4c

Signed-off-by: smajumdar <[email protected]>

Add WandB support to contextnet.py

6d3e4ca

Signed-off-by: smajumdar <[email protected]>

Style fixes

8c81303

Signed-off-by: smajumdar <[email protected]>

Remove unused import

66924b6

Signed-off-by: smajumdar <[email protected]>

blisc previously approved these changes May 14, 2020

View reviewed changes

Refactor ContextNetEncoder to subclass JasperEncoder

81330ba

Signed-off-by: smajumdar <[email protected]>

titu1994 dismissed blisc’s stale review via 81330ba May 14, 2020 23:23

Remove unused imports

7ea9183

Signed-off-by: smajumdar <[email protected]>

okuchaiev approved these changes May 16, 2020

View reviewed changes

titu1994 merged commit 99ef493 into NVIDIA:master May 16, 2020

titu1994 deleted the se_context_support branch May 16, 2020 05:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Initial support for ContextNet Encoder and CTC Decoder #630

Add Initial support for ContextNet Encoder and CTC Decoder #630

titu1994 commented May 13, 2020 •

edited

Loading

okuchaiev left a comment

okuchaiev May 13, 2020

titu1994 May 13, 2020

titu1994 May 13, 2020 •

edited

Loading

titu1994 May 15, 2020

lgtm-com bot commented May 14, 2020

blisc May 14, 2020

titu1994 May 14, 2020

lgtm-com bot commented May 14, 2020

		logging = nemo.logging


		class ContextNetEncoder(TrainableNM):

Add Initial support for ContextNet Encoder and CTC Decoder #630

Add Initial support for ContextNet Encoder and CTC Decoder #630

Conversation

titu1994 commented May 13, 2020 • edited Loading

Changelog

Added

Modified

okuchaiev left a comment

Choose a reason for hiding this comment

okuchaiev May 13, 2020

Choose a reason for hiding this comment

titu1994 May 13, 2020

Choose a reason for hiding this comment

titu1994 May 13, 2020 • edited Loading

Choose a reason for hiding this comment

titu1994 May 15, 2020

Choose a reason for hiding this comment

lgtm-com bot commented May 14, 2020

blisc May 14, 2020

Choose a reason for hiding this comment

titu1994 May 14, 2020

Choose a reason for hiding this comment

lgtm-com bot commented May 14, 2020

titu1994 commented May 13, 2020 •

edited

Loading

titu1994 May 13, 2020 •

edited

Loading