We introduce Three Towers (3T), a flexible method to improve the contras...
Scaling laws have been recently employed to derive compute-optimal model...
There has been a recent explosion of computer vision models which perfor...
We propose a simple pairwise sigmoid loss for image-text pre-training. U...
Misalignment between model predictions and intended usage can be detrime...
Vision Transformers convert images to sequences by slicing them into pat...
Effective scaling and a flexible task interface enable large language mo...
The remarkable progress in deep learning in recent years is largely driv...
We introduce UViM, a unified approach capable of modeling a wide range o...
It is commonly accepted that the Vision Transformer model requires
sophi...
This work presents a simple vision transformer design as a strong baseli...
This paper presents contrastive-tuning, a simple method employing contra...
Vision Transformers (ViT) have been shown to attain highly competitive
p...
Accurate estimation of predictive uncertainty (model calibration) is
ess...
There is a growing discrepancy in computer vision between large-scale mo...
Attention-based neural networks such as the Vision Transformer (ViT) hav...
Convolutional Neural Networks (CNNs) are the go-to model for computer vi...
Before deploying machine learning models it is critical to assess their
...
Meta and transfer learning are two successful families of approaches to
...
ML models often exhibit unexpectedly poor behavior when they are deploye...
While the Transformer architecture has become the de-facto standard for
...
Automatically finding good and general remote sensing representations al...
Modern deep convolutional networks (CNNs) are often criticized for not
g...
Yes, and no. We ask whether recent progress on the ImageNet classificati...
Transfer of pre-trained representations improves sample efficiency and
s...
Given the importance of remote sensing, surprisingly little attention ha...
This work tackles the problem of semi-supervised learning of image
class...
Deep generative models are becoming a cornerstone of modern machine lear...
Unsupervised visual representation learning remains a largely unsolved
p...
Conditional GANs are at the forefront of natural image synthesis. The ma...
GANs involve training two networks in an adversarial game, where each
ne...
Generative Adversarial Networks (GANs) are a class of deep generative mo...
We propose a new learning paradigm called Deep Memory. It has the potent...