Yupan Huang | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Jiebo Luo
200 publications
Furu Wei
184 publications
Houqiang Li
144 publications
Li Dong
95 publications
Shuming Ma
66 publications
Jianlong Fu
55 publications
Shaohan Huang
46 publications
Bei Liu
29 publications
Yilong Yin
29 publications
Lei Cui
28 publications
Wenhui Wang
26 publications

research

∙ 09/20/2023

Kosmos-2.5: A Multimodal Literate Model

We present Kosmos-2.5, a multimodal literate model for machine reading o...

0 Tengchao Lv, et al. ∙

research

∙ 04/18/2022

LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Self-supervised pre-training techniques have achieved remarkable progres...

0 Yupan Huang, et al. ∙

research

∙ 10/19/2021

A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation

A creative image-and-text generative AI system mimics humans' extraordin...

0 Yupan Huang, et al. ∙

research

∙ 10/19/2021

Unifying Multimodal Transformer for Bi-directional Image and Text Generation

We study the joint learning of image-to-text and text-to-image generatio...

0 Yupan Huang, et al. ∙

research

∙ 06/25/2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training

Vision-Language Pre-training (VLP) aims to learn multi-modal representat...

0 Hongwei Xue, et al. ∙

research

∙ 04/07/2021

Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

We study joint learning of Convolutional Neural Network (CNN) and Transf...

0 Zhicheng Huang, et al. ∙

research

∙ 04/24/2020

Reinforcing Short-Length Hashing

Due to the compelling efficiency in retrieval and storage, similarity-pr...

0 Xingbo Liu, et al. ∙

research

∙ 04/16/2019

Decoupling Localization and Classification in Single Shot Temporal Action Detection

Video temporal action detection aims to temporally localize and recogniz...

0 Yupan Huang, et al. ∙

Success!

An error occurred