diff --git a/.gitignore b/.gitignore index 0ecdaa49ec94..1ff2a92cac64 100644 --- a/.gitignore +++ b/.gitignore @@ -94,6 +94,10 @@ target/ # Jupyter Notebook .ipynb_checkpoints +# Override Jupyter in Github Language states for more accurate estimate of repo code. +# Reference: https://github.com/github/linguist/blob/master/docs/overrides.md#generated-code +*.ipynb linguist-generated + # IPython profile_default/ ipython_config.py diff --git a/PUBLICATIONS.md b/PUBLICATIONS.md index 67c7cd11084b..365ed2773ed3 100644 --- a/PUBLICATIONS.md +++ b/PUBLICATIONS.md @@ -6,6 +6,21 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I # Automatic Speech Recognition (ASR) +
+ 2023 + + * [Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-to-End Automatic Speech Recognition](https://ieeexplore.ieee.org/abstract/document/10022960) + * [Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition](https://ieeexplore.ieee.org/abstract/document/10023219) + +
+ +
+ 2022 + + * [Multi-blank Transducers for Speech Recognition](https://arxiv.org/abs/2211.03541) + +
+
2021 @@ -44,9 +59,9 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I ## Speaker Recognition (SpkR)
- 2021 + 2022 - * [TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context]( https://arxiv.org/pdf/2110.04410.pdf) + * [TitaNet: Neural Model for Speaker Representation with 1D Depth-Wise Separable Convolutions and Global Context](https://ieeexplore.ieee.org/abstract/document/9746806)
@@ -62,6 +77,15 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I ## Speech Classification +
+ 2022 + + * [AmberNet: A Compact End-to-End Model for Spoken Language Identification](https://arxiv.org/abs/2210.15781) + * [Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models](https://arxiv.org/abs/2211.05103) + + +
+
2021 @@ -78,12 +102,32 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I
+-------- + +## Speech Translation + +
+ 2022 + + * [NVIDIA NeMo Offline Speech Translation Systems for IWSLT 2022](https://aclanthology.org/2022.iwslt-1.18/) + +
+ + -------- # Natural Language Processing (NLP) ## Language Modeling +
+ 2022 + + * [Evaluating Parameter Efficient Learning for Generation](https://arxiv.org/abs/2210.13673) + * [Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models](https://arxiv.org/abs/2111.15617) + +
+
2021 @@ -93,6 +137,13 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I ## Neural Machine Translation +
+ 2022 + + * [Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation](https://arxiv.org/abs/2206.01137) + +
+
2021 @@ -122,6 +173,13 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I # Text To Speech (TTS) +
+ 2022 + + * [Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers](https://arxiv.org/abs/2211.00585) + +
+
2021 @@ -140,6 +198,7 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I 2022 * [Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization](https://arxiv.org/abs/2203.15917) + * [Thutmose Tagger: Single-pass neural model for Inverse Text Normalization](https://arxiv.org/abs/2208.00064)