Shao-Yen Tseng

research

∙ 05/31/2023

ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning

Two-Tower Vision-Language (VL) models have shown promising improvements ...

0 Xiao Xu, et al. ∙

research

∙ 05/18/2023

LDM3D: Latent Diffusion Model for 3D

This research paper proposes a Latent Diffusion Model for 3D (LDM3D) tha...

0 Gabriela Ben Melech Stan, et al. ∙

research

∙ 08/24/2022

Improving video retrieval using multilingual knowledge transfer

Video retrieval has seen tremendous progress with the development of vis...

4 Avinash Madasu, et al. ∙

research

∙ 03/30/2022

VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers

Breakthroughs in transformer-based models have revolutionized not only t...

0 Estelle Aflalo, et al. ∙

research

∙ 02/08/2022

CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations

Deriving multimodal representations of audio and lexical inputs is a cen...

0 Vin Sachidananda, et al. ∙

research

∙ 09/22/2021

KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation

Self-supervised vision-and-language pretraining (VLP) aims to learn tran...

5 Yongfei Liu, et al. ∙

research

∙ 09/10/2019

Multimodal Embeddings from Language Models

Word embeddings such as ELMo have recently been shown to model word sema...

0 Shao-Yen Tseng, et al. ∙

research

∙ 08/31/2019

Behavior Gated Language Models

Most current language modeling techniques only exploit co-occurrence, se...

0 Prashanth Gurunath Shivakumar, et al. ∙

research

∙ 08/02/2019

Predicting Behavior in Cancer-Afflicted Patient and Spouse Interactions using Speech and Language

Cancer impacts the quality of life of those diagnosed as well as their s...

0 Sandeep Nallan Chakravarthula, et al. ∙

research

∙ 07/18/2018

Multi-Task Unsupervised Contextual Learning for Behavioral Annotation

Unsupervised learning has been an attractive method for easily deriving ...

0 Shao-Yen Tseng, et al. ∙

research

∙ 12/27/2017

Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection

State-of-the-art audio event detection (AED) systems rely on supervised ...

0 Shao-Yen Tseng, et al. ∙

research

∙ 12/27/2017

Multiple Instance Deep Learning for Weakly Supervised Audio Event Detection

State-of-the-art audio event detection (AED) systems rely on supervised ...

0 Shao-Yen Tseng, et al. ∙

Shao-Yen Tseng

Featured Co-authors

Sign in with Google

Consider DeepAI Pro