Wengang Zhou

research

∙ 08/23/2023

Sign Language Translation with Iterative Prototype

This paper presents IP-SLT, a simple yet effective framework for sign la...

0 Huijie Yao, et al. ∙

research

∙ 08/19/2023

UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding

In the era of Large Language Models (LLMs), tremendous strides have been...

0 Hao Feng, et al. ∙

research

∙ 08/17/2023

SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

In fisheye images, rich distinct distortion patterns are regularly distr...

0 Hao Feng, et al. ∙

research

∙ 08/17/2023

Text-Only Training for Visual Storytelling

Visual storytelling aims to generate a narrative based on a sequence of ...

0 Yuechen Wang, et al. ∙

research

∙ 08/14/2023

Masked Motion Predictors are Strong 3D Action Representation Learners

In 3D human action recognition, limited supervised data makes it challen...

0 Yunyao Mao, et al. ∙

research

∙ 08/11/2023

Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection

Recent progress in weakly supervised object detection is featured by a c...

0 Yufei Yin, et al. ∙

research

∙ 08/08/2023

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Reconstructing interacting hands from monocular RGB data is a challengin...

0 Weichao Zhao, et al. ∙

research

∙ 07/17/2023

AltFreezing for More General Video Face Forgery Detection

Existing face forgery detection models try to discriminate fake images b...

0 Zhendong Wang, et al. ∙

research

∙ 06/09/2023

Exploring Effective Mask Sampling Modeling for Neural Image Compression

Image compression aims to reduce the information redundancy in images. M...

0 Lin Liu, et al. ∙

research

∙ 06/03/2023

MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning

Recent approaches have utilized self-supervised auxiliary tasks as repre...

0 Haolin Song, et al. ∙

research

∙ 05/26/2023

Detect Any Shadow: Segment Anything for Video Shadow Detection

Segment anything model (SAM) has achieved great success in the field of ...

0 Yonghui Wang, et al. ∙

research

∙ 05/16/2023

Hybrid and Collaborative Passage Reranking

In passage retrieval system, the initial passage retrieval results may b...

0 Zongmeng Zhang, et al. ∙

research

∙ 05/08/2023

SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding

Hand gesture serves as a crucial role during the expression of sign lang...

0 Hezhen Hu, et al. ∙

research

∙ 04/18/2023

Deep Unrestricted Document Image Rectification

In recent years, tremendous efforts have been made on document image rec...

0 Hao Feng, et al. ∙

research

∙ 04/12/2023

Learning Transferable Pedestrian Representation from Multimodal Information Supervision

Recent researches on unsupervised person re-identification (reID) have d...

0 Liping Bao, et al. ∙

research

∙ 03/24/2023

HandNeRF: Neural Radiance Fields for Animatable Interacting Hands

We propose a novel framework to reconstruct accurate appearance and geom...

0 Zhiyang Guo, et al. ∙

research

∙ 03/16/2023

DIRE for Diffusion-Generated Image Detection

Diffusion models have shown remarkable success in visual synthesis, but ...

0 Zhendong Wang, et al. ∙

research

∙ 03/16/2023

Focus on Your Target: A Dual Teacher-Student Framework for Domain-adaptive Semantic Segmentation

We study unsupervised domain adaptation (UDA) for semantic segmentation....

0 Xinyue Huo, et al. ∙

research

∙ 02/10/2023

BEST: BERT Pre-Training for Sign Language Recognition with Coupling Tokenization

In this work, we are dedicated to leveraging the BERT pre-training succe...

0 Weichao Zhao, et al. ∙

research

∙ 01/25/2023

Discriminative Experience Replay for Efficient Multi-agent Reinforcement Learning

In cooperative multi-agent tasks, parameter sharing among agents is a co...

0 Xunhan Hu, et al. ∙

research

∙ 01/21/2023

Recurrent Contour-based Instance Segmentation with Progressive Learning

Contour-based instance segmentation has been actively studied, thanks to...

0 Hao Feng, et al. ∙

research

∙ 11/28/2022

Hand-Object Interaction Image Generation

In this work, we are dedicated to a new task, i.e., hand-object interact...

0 Hezhen Hu, et al. ∙

research

∙ 11/28/2022

CLIP2GAN: Towards Bridging Text with the Latent Space of GANs

In this work, we are dedicated to text-guided image generation and propo...

0 Yixuan Wang, et al. ∙

research

∙ 11/22/2022

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

We present SinDiffusion, leveraging denoising diffusion models to captur...

0 Weilun Wang, et al. ∙

research

∙ 10/31/2022

DanZero: Mastering GuanDan Game with Reinforcement Learning

Card game AI has always been a hot topic in the research of artificial i...

0 Yudong Lu, et al. ∙

research

∙ 10/21/2022

Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding

Temporal language grounding (TLG) aims to localize a video segment in an...

0 Yuechen Wang, et al. ∙

research

∙ 10/15/2022

UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior

Document images captured by mobile devices are usually degraded by uncon...

0 Yonghui Wang, et al. ∙

research

∙ 10/15/2022

Geometric Representation Learning for Document Image Rectification

In document image rectification, there exist rich geometric constraints ...

0 Hao Feng, et al. ∙

research

∙ 08/26/2022

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

In 3D action recognition, there exists rich complementary information be...

0 Yunyao Mao, et al. ∙

research

∙ 08/23/2022

Low-Light Video Enhancement with Synthetic Event Guidance

Low-light video enhancement (LLVE) is an important yet challenging task ...

1 Lin Liu, et al. ∙

research

∙ 07/14/2022

Unified 2D and 3D Pre-Training of Molecular Representations

Molecular representation learning has attracted much attention recently....

0 Jinhua Zhu, et al. ∙

research

∙ 06/30/2022

Semantic Image Synthesis via Diffusion Models

Denoising Diffusion Probabilistic Models (DDPMs) have achieved remarkabl...

6 Weilun Wang, et al. ∙

research

∙ 06/14/2022

TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer

In this work, we explore neat yet effective Transformer-based frameworks...

19 Jiajun Deng, et al. ∙

research

∙ 06/08/2022

Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer

The increased integration of renewable energy poses a slew of technical ...

0 Minrui Wang, et al. ∙

research

∙ 05/08/2022

Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods

Actor-critic Reinforcement Learning (RL) algorithms have achieved impres...

1 Qing Li, et al. ∙

research

∙ 05/07/2022

Multi-Target Active Object Tracking with Monte Carlo Tree Search and Target Motion Modeling

In this work, we are dedicated to multi-target active object tracking (A...

14 Zheng Chen, et al. ∙

research

∙ 05/05/2022

LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning

Cooperative multi-agent reinforcement learning (MARL) has made prominent...

2 Mingyu Yang, et al. ∙

research

∙ 04/06/2022

Domain-Agnostic Prior for Transfer Semantic Segmentation

Unsupervised domain adaptation (UDA) is an important topic in the comput...

0 Xinyue Huo, et al. ∙

research

∙ 04/06/2022

DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning

Recent years have witnessed the great breakthrough of deep reinforcement...

5 Youpeng Zhao, et al. ∙

research

∙ 03/21/2022

Learning Enriched Illuminants for Cross and Single Sensor Color Constancy

Color constancy aims to restore the constant colors of a scene under dif...

1 Xiaodong Cun, et al. ∙

research

∙ 03/16/2022

Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents

Multi-agent reinforcement learning is difficult to be applied in practic...

3 Jian Zhao, et al. ∙

research

∙ 03/16/2022

CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning

Due to the partial observability and communication constraints in many m...

6 Jian Zhao, et al. ∙

research

∙ 03/11/2022

TAPE: Task-Agnostic Prior Embedding for Image Restoration

Learning an generalized prior for natural image restoration is an import...

0 Lin Liu, et al. ∙

research

∙ 03/10/2022

MVP: Multimodality-guided Visual Pre-training

Recently, masked image modeling (MIM) has become a promising direction f...

0 Longhui Wei, et al. ∙

research

∙ 02/22/2022

Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object Tracking

Active Multi-Object Tracking (AMOT) is a task where cameras are controll...

8 Zeyu Fang, et al. ∙

research

∙ 02/21/2022

DQMIX: A Distributional Perspective on Multi-Agent Reinforcement Learning

In cooperative multi-agent tasks, a team of agents jointly interact with...

14 Jian Zhao, et al. ∙

research

∙ 02/09/2022

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

In cooperative multi-agent systems, agents jointly take actions and rece...

11 Jian Zhao, et al. ∙

research

∙ 02/03/2022

Direct Molecular Conformation Generation

Molecular conformation generation aims to generate three-dimensional coo...

9 Jinhua Zhu, et al. ∙

research

∙ 10/29/2021

Unsupervised Person Re-Identification with Wireless Positioning under Weak Scene Labeling

Existing unsupervised person re-identification methods only rely on visu...

0 Yiheng Liu, et al. ∙

research

∙ 10/28/2021

DocScanner: Robust Document Image Rectification with Progressive Learning

Compared to flatbed scanners, portable smartphones are much more conveni...

0 Hao Feng, et al. ∙

Wengang Zhou

Featured Co-authors

Sign in with Google

Consider DeepAI Pro