Xiaodong Han

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Yu Qiao
242 publications
Meng Wang
217 publications
Dong Li
125 publications
Yuchao Dai
82 publications
Lingpeng Kong
59 publications
Nick Barnes
55 publications
Yiran Zhong
43 publications
Zhen Qin
34 publications
Xiao Luo
29 publications
Dongxu Li
22 publications
Weixuan Sun
14 publications

research

∙ 08/16/2023

Improving Audio-Visual Segmentation with Bidirectional Generation

The aim of audio-visual segmentation (AVS) is to precisely differentiate...

0 Dawei Hao, et al. ∙

research

∙ 07/27/2023

Scaling TransNormer to 175 Billion Parameters

We present TransNormerLLM, the first linear attention-based Large Langua...

0 Zhen Qin, et al. ∙

research

∙ 07/18/2023

Linearized Relative Positional Encoding

Relative positional encoding is widely used in vanilla and linear transf...

0 Zhen Qin, et al. ∙

research

∙ 05/08/2023

Toeplitz Neural Network for Sequence Modeling

Sequence modeling has important applications in natural language process...

0 Zhen Qin, et al. ∙

research

∙ 03/27/2023

Fine-grained Audible Video Description

We explore a new task for audio-visual-language modeling called fine-gra...

0 Xuyang Shen, et al. ∙

research

∙ 10/19/2022

The Devil in Linear Transformer

Linear transformers aim to reduce the quadratic space-time complexity of...

0 Zhen Qin, et al. ∙

research

∙ 10/15/2022

Linear Video Transformer with Feature Fixation

Vision Transformers have achieved impressive performance in video classi...

0 Kaiyue Lu, et al. ∙

Success!

An error occurred

Xiaodong Han

Featured Co-authors

Improving Audio-Visual Segmentation with Bidirectional Generation

Scaling TransNormer to 175 Billion Parameters

Linearized Relative Positional Encoding

Toeplitz Neural Network for Sequence Modeling

Fine-grained Audible Video Description

The Devil in Linear Transformer

Linear Video Transformer with Feature Fixation

Sign in with Google

Consider DeepAI Pro