Teng Wang

research

∙ 08/22/2023

Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models

Pre-trained vision-language models, e.g., CLIP, working with manually de...

0 Baoshuo Kan, et al. ∙

research

∙ 07/31/2023

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

Image-to-text generation aims to describe images using natural language....

0 Junjie Fei, et al. ∙

research

∙ 07/26/2023

Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models

Vision-language pre-training (VLP) models have shown vulnerability to ad...

0 Dong Lu, et al. ∙

research

∙ 06/17/2023

LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning

Our winning entry for the CVPR 2023 Generic Event Boundary Captioning (G...

0 Yunlong Tang, et al. ∙

research

∙ 04/27/2023

π-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation

Foundation models have achieved great advances in multi-task learning wi...

12 Chengyue Wu, et al. ∙

research

∙ 03/24/2023

Accelerating Vision-Language Pretraining with Free Language Modeling

The state of the arts in vision-language pretraining (VLP) achieves exem...

2 Teng Wang, et al. ∙

research

∙ 03/22/2023

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline

Existing audio-visual event localization (AVE) handles manually trimmed ...

0 Tiantian Geng, et al. ∙

research

∙ 03/11/2023

Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos

Joint video-language learning has received increasing attention in recen...

0 Teng Wang, et al. ∙

research

∙ 03/02/2023

LANDMARK: Language-guided Representation Enhancement Framework for Scene Graph Generation

Scene graph generation (SGG) is a sophisticated task that suffers from b...

0 Xiaoguang Chang, et al. ∙

research

∙ 09/25/2022

Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward

Advertisement video editing aims to automatically edit advertising video...

0 Yunlong Tang, et al. ∙

research

∙ 07/03/2022

Exploiting Context Information for Generic Event Boundary Captioning

Generic Event Boundary Captioning (GEBC) aims to generate three sentence...

0 Jinrui Zhang, et al. ∙

research

∙ 06/17/2022

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix

Existing vision-language pre-training (VLP) methods primarily rely on pa...

0 Teng Wang, et al. ∙

research

∙ 04/21/2022

Transformer-Guided Convolutional Neural Network for Cross-View Geolocalization

Ground-to-aerial geolocalization refers to localizing a ground-level que...

6 Teng Wang, et al. ∙

research

∙ 04/13/2022

Semantic-Aware Pretraining for Dense Video Captioning

This report describes the details of our approach for the event dense-ca...

6 Teng Wang, et al. ∙

research

∙ 03/17/2022

Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation

Scene graph generation is a sophisticated task because there is no speci...

0 Xiaoguang Chang, et al. ∙

research

∙ 08/17/2021

End-to-End Dense Video Captioning with Parallel Decoding

Dense video captioning aims to generate multiple associated captions wit...

0 Teng Wang, et al. ∙

research

∙ 06/23/2021

Transformer Meets Convolution: A Bilateral Awareness Net-work for Semantic Segmentation of Very Fine Resolution Ur-ban Scene Images

Semantic segmentation from very fine resolution (VFR) urban scene images...

0 Libo Wang, et al. ∙

research

∙ 06/04/2021

PoDT: A Secure Multi-chains Consensus Scheme Against Diverse Miners Behaviors Attacks in Blockchain Networks

As cross-chain technologies make the interactions among different blockc...

0 Jingyu Feng, et al. ∙

research

∙ 05/17/2021

Multi-modal Visual Place Recognition in Dynamics-Invariant Perception Space

Visual place recognition is one of the essential and challenging problem...

0 Lin Wu, et al. ∙

research

∙ 03/22/2021

ConfInLog: Leveraging Software Logs to Infer Configuration Constraints

Misconfigurations have become the dominant causes of software failures i...

0 Shulin Zhou, et al. ∙

research

∙ 10/11/2020

A Comprehensive Survey on Local Differential Privacy Toward Data Statistics and Analysis in Crowdsensing

Collecting and analyzing massive data generated from smart devices have ...

0 Teng Wang, et al. ∙

research

∙ 06/21/2020

Dense-Captioning Events in Videos: SYSU Submission to ActivityNet Challenge 2020

This technical report presents a brief description of our submission to ...

0 Teng Wang, et al. ∙

research

∙ 04/19/2020

Local Differential Privacy based Federated Learning for Internet of Things

Internet of Vehicles (IoV) is a promising branch of the Internet of Thin...

0 fcq, et al. ∙

research

∙ 11/27/2019

Reviewing and Improving the Gaussian Mechanism for Differential Privacy

Differential privacy provides a rigorous framework to quantify data priv...

8 Jun Zhao, et al. ∙

research

∙ 07/11/2019

Conditional Analysis for Key-Value Data with Local Differential Privacy

Local differential privacy (LDP) has been deemed as the de facto measure...

0 Lin Sun, et al. ∙

research

∙ 06/05/2019

Locally Differentially Private Data Collection and Analysis

Local differential privacy (LDP) can provide each user with strong priva...

0 Teng Wang, et al. ∙

research

∙ 06/04/2019

Privacy-preserving Crowd-guided AI Decision-making in Ethical Dilemmas

With the rapid development of artificial intelligence (AI), ethical issu...

6 Teng Wang, et al. ∙

research

∙ 12/25/2018

A Survey of FPGA Based Deep Learning Accelerators: Challenges and Opportunities

With the rapid development of in-depth learning, neural network and deep...

0 Teng Wang, et al. ∙

research

∙ 05/07/2015

Development of a Burst Buffer System for Data-Intensive Applications

Modern parallel filesystems such as Lustre are designed to provide high,...

0 Teng Wang, et al. ∙

Teng Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro