We present Spatial LibriSpeech, a spatial audio dataset with over 650 ho...
The mechanisms behind the success of multi-view self-supervised learning...
Multiview Self-Supervised Learning (MSSL) is based on learning invarianc...
Lack of diversity in data collection has caused significant failures in
...
In this work, we observe that many existing self-supervised learning
alg...
Human skeleton point clouds are commonly used to automatically classify ...
As the use of deep learning in high impact domains becomes ubiquitous, i...
Image augmentations applied during training are crucial for the
generali...
We study the presence of expert units in pre-trained Transformer-based
L...
In this work we study the presence of expert units in pre-trained Transf...
Principal Filter Analysis (PFA), is an elegant, easy to implement, yet
e...