Zejun Li

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Xuanjing Huang
123 publications
Jianqing Fan
83 publications
Zhongyu Wei
45 publications
Jingjing Chen
39 publications
Siyuan Wang
22 publications
Chun Yang
22 publications
Xu-Cheng Yin
21 publications
Lei Xiao
14 publications
Zhihao Fan
13 publications
Hongfa Wang
9 publications
Jiarong Xu
9 publications

research

∙ 06/11/2022

A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training

Multi-modal pre-training and knowledge discovery are two important resea...

0 Zhihao Fan, et al. ∙

research

∙ 01/29/2022

MVPTR: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment

In this paper, we propose a Multi-stage Vision-language Pre-TRaining (MV...

0 Zejun Li, et al. ∙

research

∙ 11/05/2021

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval

Matching model is essential for Image-Text Retrieval framework. Existing...

16 Zhihao Fan, et al. ∙

research

∙ 09/12/2021

Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval

Existing research for image text retrieval mainly relies on sentence-lev...

8 Zhihao Fan, et al. ∙

research

∙ 06/21/2021

TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning

Existing research for image captioning usually represents an image using...

0 Zhihao Fan, et al. ∙

research

∙ 03/21/2021

An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information

In this paper, we focus on the problem of unsupervised image-sentence ma...

0 Zejun Li, et al. ∙

research

∙ 10/10/2017

AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition

Recognizing text in the wild is a really challenging task because of com...

0 Chun Yang, et al. ∙

Success!

An error occurred

Zejun Li

Featured Co-authors

A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training

MVPTR: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval

Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval

TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning

An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information

AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition

Sign in with Google

Consider DeepAI Pro