Heng Wang

research

∙ 08/18/2023

V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models

Building artificial intelligence (AI) systems on top of a set of foundat...

0 Heng Wang, et al. ∙

research

∙ 07/27/2023

Exploring Annotation-free Image Captioning with Retrieval-augmented Pseudo Sentence Generation

Training an image captioner without annotated image-sentence pairs has g...

0 Zhiyuan Li, et al. ∙

research

∙ 05/17/2023

Can Language Models Solve Graph Problems in Natural Language?

Large language models (LLMs) are increasingly adopted for a variety of t...

0 Heng Wang, et al. ∙

research

∙ 04/22/2023

Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks

Online movie review platforms are providing crowdsourced feedback for th...

0 Heng Wang, et al. ∙

research

∙ 04/08/2023

PVD-AL: Progressive Volume Distillation with Active Learning for Efficient Conversion Between Different NeRF Architectures

Neural Radiance Fields (NeRF) have been widely adopted as practical and ...

0 Shuangkang Fang, et al. ∙

research

∙ 03/25/2023

PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters

We propose PAniC-3D, a system to reconstruct stylized 3D character heads...

0 Shuhong Chen, et al. ∙

research

∙ 03/09/2023

Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision

Many top-down architectures for instance segmentation achieve significan...

3 Tarun Kalluri, et al. ∙

research

∙ 01/18/2023

Temporal Perceiving Video-Language Pre-training

Video-Language Pre-training models have recently significantly improved ...

0 Fan Ma, et al. ∙

research

∙ 11/29/2022

One is All: Bridging the Gap Between Neural Radiance Fields Architectures with Progressive Volume Distillation

Neural Radiance Fields (NeRF) methods have proved effective as compact, ...

0 Shuangkang Fang, et al. ∙

research

∙ 10/15/2022

PointNeuron: 3D Neuron Reconstruction via Geometry and Topology Learning of Point Clouds

Digital neuron reconstruction from 3D microscopy images is an essential ...

0 Runkai Zhao, et al. ∙

research

∙ 06/09/2022

TwiBot-22: Towards Graph-Based Twitter Bot Detection

Twitter bot detection has become an increasingly important task to comba...

0 Shangbin Feng, et al. ∙

research

∙ 06/01/2022

Towards Generalisable Audio Representations for Audio-Visual Navigation

In audio-visual navigation (AVN), an intelligent agent needs to navigate...

0 Shunqi Mao, et al. ∙

research

∙ 04/22/2022

Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds

Dense captioning in 3D point clouds is an emerging vision-and-language t...

12 Heng Wang, et al. ∙

research

∙ 04/12/2022

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity

Open-world instance segmentation is the task of grouping pixels into obj...

2 Weiyao Wang, et al. ∙

research

∙ 04/08/2022

Canonical Mean Filter for Almost Zero-Shot Multi-Task classification

The support set is a key to providing conditional prior for fast adaptio...

0 Yong Li, et al. ∙

research

∙ 11/18/2021

PyTorchVideo: A Deep Learning Library for Video Understanding

We introduce PyTorchVideo, an open-source deep-learning library that pro...

295 Haoqi Fan, et al. ∙

research

∙ 08/30/2021

Searching for Two-Stream Models in Multivariate Space for Video Recognition

Conventional video models rely on a single stream to capture the complex...

0 Xinyu Gong, et al. ∙

research

∙ 06/29/2021

Towards Understanding the Effectiveness of Attention Mechanism

Attention Mechanism is a widely used method for improving the performanc...

0 Xiang Ye, et al. ∙

research

∙ 04/10/2021

Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation

Current state-of-the-art object detection and segmentation methods work ...

0 Weiyao Wang, et al. ∙

research

∙ 04/02/2021

Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories

The standard way of training video models entails sampling at each itera...

0 Xitong Yang, et al. ∙

research

∙ 02/09/2021

Is Space-Time Attention All You Need for Video Understanding?

We present a convolution-free approach to video classification built exc...

0 Gedas Bertasius, et al. ∙

research

∙ 03/14/2020

From W-Net to CDGAN: Bi-temporal Change Detection via Deep Learning Techniques

Traditional change detection methods usually follow the image differenci...

0 Bin Hou, et al. ∙

research

∙ 12/19/2019

CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension

We present a Chinese judicial reading comprehension (CJRC) dataset which...

0 Xingyi Duan, et al. ∙

research

∙ 12/14/2019

Region and Object based Panoptic Image Synthesis through Conditional GANs

Image-to-image translation is significant to many computer vision and ma...

0 Heng Wang, et al. ∙

research

∙ 11/20/2019

CAIL2019-SCM: A Dataset of Similar Case Matching in Legal Domain

In this paper, we introduce CAIL2019-SCM, Chinese AI and Law 2019 Simila...

0 Chaojun Xiao, et al. ∙

research

∙ 06/10/2019

FASTER Recurrent Networks for Video Classification

Video classification methods often divide the video into short clips, do...

0 Linchao Zhu, et al. ∙

research

∙ 06/07/2019

Video Modeling with Correlation Networks

Motion is a salient cue to recognize actions in video. Modern action rec...

0 Heng Wang, et al. ∙

research

∙ 05/02/2019

Large-scale weakly-supervised pre-training for video action recognition

Current fully-supervised video datasets consist of only a few hundred th...

0 Deepti Ghadiyaram, et al. ∙

research

∙ 04/04/2019

Video Classification with Channel-Separated Convolutional Networks

Group convolution has been shown to offer great computational savings in...

0 Du Tran, et al. ∙

research

∙ 04/03/2019

Multi-task Learning for Chinese Word Usage Errors Detection

Chinese word usage errors often occur in non-native Chinese learners' wr...

0 Jinbin Zhang, et al. ∙

research

∙ 04/03/2019

Defeats GAN: A Simpler Model Outperforms in Knowledge Representation Learning

The goal of knowledge representation learning is to embed entities and r...

0 Heng Wang, et al. ∙

research

∙ 10/13/2018

Overview of CAIL2018: Legal Judgment Prediction Competition

In this paper, we give an overview of the Legal Judgment Prediction (LJP...

0 Haoxi Zhong, et al. ∙

research

∙ 07/04/2018

CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction

In this paper, we introduce the Chinese AI and Law challenge dataset (CA...

0 Chaojun Xiao, et al. ∙

research

∙ 02/20/2018

Devon: Deformable Volume Network for Learning Optical Flow

We propose a lightweight neural network model, Deformable Volume Network...

0 Yao Lu, et al. ∙

research

∙ 12/26/2017

SLAC: A Sparsely Labeled Dataset for Action Classification and Localization

This paper describes a procedure for the creation of large-scale video d...

0 Hang Zhao, et al. ∙

research

∙ 12/01/2017

Text Generation Based on Generative Adversarial Nets with Latent Variable

In this paper, we propose a model using generative adversarial net (GAN)...

0 Heng Wang, et al. ∙

research

∙ 11/30/2017

A Closer Look at Spatiotemporal Convolutions for Action Recognition

In this paper we discuss several forms of spatiotemporal convolutions fo...

0 Du Tran, et al. ∙

research

∙ 07/25/2017

Concept Drift Detection and Adaptation with Hierarchical Hypothesis Testing

In a streaming environment, there is often a need for statistical predic...

0 Shujian Yu, et al. ∙

research

∙ 04/21/2015

A robust and efficient video representation for action recognition

This paper introduces a state-of-the-art video representation and applie...

0 Heng Wang, et al. ∙

research

∙ 04/04/2015

Concept Drift Detection for Streaming Data

Common statistical prediction models often require and assume stationari...

0 Heng Wang, et al. ∙

Heng Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro