In this work, we provide a recipe for training machine translation model...
Large decoder-only language models (LMs) can be largely improved in term...
General translation models often still struggle to generate accurate
tra...
This paper provides an overview of NVIDIA NeMo's neural machine translat...
In the English speech-to-text (STT) machine learning task, acoustic mode...
NeMo (Neural Modules) is a Python framework-agnostic toolkit for creatin...
We propose NovoGrad, a first-order stochastic gradient method with layer...
In this paper, we report state-of-the-art results on LibriSpeech among
e...
We present OpenSeq2Seq -- an open-source toolkit for training
sequence-t...
Deep neural networks have enabled progress in a wide variety of applicat...
This paper proposes a novel model for the rating prediction task in
reco...
We present two simple ways of reducing the number of parameters and
acce...