Conformer, combining convolution and self-attention sequentially to capt...
The Transformer architecture has been well adopted as a dominant archite...
This paper is a study of performance-efficiency trade-offs in pre-traine...
In this paper, we explore the use of pre-trained language models to lear...
This paper presents multistream CNN, a novel neural network architecture...
In this paper we present state-of-the-art (SOTA) performance on the
Libr...
The freedom of fast iterations of distributed deep learning tasks is cru...
In automated machine learning systems, concept drift in input data is on...
A unique challenge for e-commerce recommendation is that customers are o...