research
∙
07/22/2022
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
We introduce a zero-shot video captioning method that employs two frozen...
research
∙
03/30/2022
End to End Lip Synchronization with a Temporal AutoEncoder
We study the problem of syncing the lip movement in a video with the aud...
research
∙
11/29/2021
Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Recent text-to-image matching models apply contrastive learning to large...
research
∙
11/13/2020