Autoregressive decoding limits the efficiency of transformers for Machin...
We analyze how large language models (LLMs) represent out-of-context wor...
The use of relative representations for latent embeddings has shown pote...
Aligning the visual and language spaces requires to train deep neural
ne...
Neural networks embed the geometric structure of a data manifold lying i...
Graph Neural Networks (GNNs) have proven to be successful in several
pre...
Many modern deep-learning techniques do not work without enormous datase...