The number of end-to-end speech recognition models grows every year. The...
This paper presents a framework based on Weighted Finite-State Transduce...
The use of momentum in stochastic gradient methods has become a widespre...
We present OpenSeq2Seq -- an open-source toolkit for training
sequence-t...
In this paper we explore different regression models based on Clusterwis...
The rise of deep learning in recent years has brought with it increasing...
Batch normalization (BN) has become a de facto standard for training dee...
A common way to speed up training of large convolutional networks is to ...