research
          
      
      ∙
      07/22/2022
    Zero-Shot Video Captioning with Evolving Pseudo-Tokens
We introduce a zero-shot video captioning method that employs two frozen...
          
            research
          
      
      ∙
      03/30/2022
    End to End Lip Synchronization with a Temporal AutoEncoder
We study the problem of syncing the lip movement in a video with the aud...
          
            research
          
      
      ∙
      11/29/2021
    Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Recent text-to-image matching models apply contrastive learning to large...
          
            research
          
      
      ∙
      11/13/2020