Yang Yang

research

∙ 09/19/2023

Disentangled Information Bottleneck guided Privacy-Protective JSCC for Image Transmission

Joint source and channel coding (JSCC) has attracted increasing attentio...

0 Lunan Sun, et al. ∙

research

∙ 09/18/2023

How to Generate Popular Post Headlines on Social Media?

Posts, as important containers of user-generated-content pieces on socia...

0 Zhouxiang Fang, et al. ∙

research

∙ 09/18/2023

Learning to Generate Lumped Hydrological Models

In a lumped hydrological model structure, the hydrological function of a...

0 Yang Yang, et al. ∙

research

∙ 09/15/2023

Privacy-Aware Joint Source-Channel Coding for image transmission based on Disentangled Information Bottleneck

Current privacy-aware joint source-channel coding (JSCC) works aim at av...

0 Lunan Sun, et al. ∙

research

∙ 09/05/2023

NICE 2023 Zero-shot Image Captioning Challenge

In this report, we introduce NICE project[<https://nice.lgresearch.ai/>]...

0 Taehoon Kim, et al. ∙

research

∙ 08/15/2023

SPM: Structured Pretraining and Matching Architectures for Relevance Modeling in Meituan Search

In e-commerce search, relevance between query and documents is an essent...

0 Wen Zan, et al. ∙

research

∙ 08/14/2023

Routing Recovery for UAV Networks with Deliberate Attacks: A Reinforcement Learning based Approach

The unmanned aerial vehicle (UAV) network is popular these years due to ...

0 Sijie He, et al. ∙

research

∙ 08/10/2023

IOSG: Image-driven Object Searching and Grasping

When robots retrieve specific objects from cluttered scenes, such as hom...

0 Houjian Yu, et al. ∙

research

∙ 08/08/2023

Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination

Most existing image-text matching methods adopt triplet loss as the opti...

0 Haoxuan Li, et al. ∙

research

∙ 08/08/2023

Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval

Most existing cross-modal retrieval methods employ two-stream encoders w...

0 Yi Bin, et al. ∙

research

∙ 07/30/2023

An Effective LSTM-DDPM Scheme for Energy Theft Detection and Forecasting in Smart Grid

Energy theft detection (ETD) and energy consumption forecasting (ECF) ar...

0 Xun Yuan, et al. ∙

research

∙ 06/18/2023

Focusing on Relevant Responses for Multi-modal Rumor Detection

In the absence of an authoritative statement about a rumor, people may e...

0 Jun Li, et al. ∙

research

∙ 06/15/2023

Accelerating Dynamic Network Embedding with Billions of Parameter Updates to Milliseconds

Network embedding, a graph representation learning method illustrating n...

0 Haoran Deng, et al. ∙

research

∙ 06/11/2023

GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

Currently, the reduction in the parameter scale of large-scale pre-train...

0 Shicheng Tan, et al. ∙

research

∙ 06/10/2023

Probabilistic Multi-Dimensional Classification

Multi-dimensional classification (MDC) can be employed in a range of app...

0 Vu-Linh Nguyen, et al. ∙

research

∙ 06/08/2023

COURIER: Contrastive User Intention Reconstruction for Large-Scale Pre-Train of Image Features

With the development of the multi-media internet, visual characteristics...

0 Jia-Qi Yang, et al. ∙

research

∙ 06/03/2023

A Novel Deep Knowledge-based Learning Method for Wind Speed Forecast

The increasing installation rate of wind power poses great challenges to...

0 Yang Yang, et al. ∙

research

∙ 05/31/2023

Cross-Domain Car Detection Model with Integrated Convolutional Block Attention Mechanism

Car detection, particularly through camera vision, has become a major fo...

0 Haoxuan Xu, et al. ∙

research

∙ 05/31/2023

A Novel Black Box Process Quality Optimization Approach based on Hit Rate

Hit rate is a key performance metric in predicting process product quali...

0 Yang Yang, et al. ∙

research

∙ 05/30/2023

IDToolkit: A Toolkit for Benchmarking and Developing Inverse Design Algorithms in Nanophotonics

Aiding humans with scientific designs is one of the most exciting of art...

0 Jia-Qi Yang, et al. ∙

research

∙ 05/25/2023

Deep Neural Networks in Video Human Action Recognition: A Review

Currently, video behavior recognition is one of the most foundational ta...

0 Zihan Wang, et al. ∙

research

∙ 05/24/2023

Breaking the Curse of Quality Saturation with User-Centric Ranking

A key puzzle in search, ads, and recommendation is that the ranking mode...

0 Zhuokai Zhao, et al. ∙

research

∙ 05/23/2023

Faster Video Moment Retrieval with Point-Level Supervision

Video Moment Retrieval (VMR) aims at retrieving the most relevant events...

0 Xun Jiang, et al. ∙

research

∙ 05/21/2023

Task-agnostic Distillation of Encoder-Decoder Language Models

Finetuning pretrained language models (LMs) have enabled appealing perfo...

0 Chen Zhang, et al. ∙

research

∙ 05/20/2023

Lifting the Curse of Capacity Gap in Distilling Language Models

Pretrained language models (LMs) have shown compelling performance on va...

0 Chen Zhang, et al. ∙

research

∙ 05/16/2023

Information Energy Ratio of XOR Logic Gate at Mesoscopic Scale

As the size of transistors approaches the mesoscopic scale, existing ene...

0 Xiaohu Ge, et al. ∙

research

∙ 05/10/2023

Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era

Generative AI (AIGC, a.k.a. AI generated content) has made remarkable pr...

0 Chenghao Li, et al. ∙

research

∙ 05/09/2023

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

We present an interactive visual framework named InternGPT, or iGPT for ...

0 Zhaoyang Liu, et al. ∙

research

∙ 05/07/2023

MrTF: Model Refinery for Transductive Federated Learning

We consider a real-world scenario in which a newly-established pilot pro...

0 Xin-Chun Li, et al. ∙

research

∙ 04/30/2023

A Simulation-Augmented Benchmarking Framework for Automatic RSO Streak Detection in Single-Frame Space Images

Detecting Resident Space Objects (RSOs) and preventing collisions with o...

0 Zhe Chen, et al. ∙

research

∙ 04/23/2023

Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

The capability of Large Language Models (LLMs) like ChatGPT to comprehen...

0 Bo Li, et al. ∙

research

∙ 04/19/2023

CrossFusion: Interleaving Cross-modal Complementation for Noise-resistant 3D Object Detection

The combination of LiDAR and camera modalities is proven to be necessary...

0 Yang Yang, et al. ∙

research

∙ 04/15/2023

CoVLR: Coordinating Cross-Modal Consistency and Intra-Modal Structure for Vision-Language Retrieval

Current vision-language retrieval aims to perform cross-modal instance s...

0 Yang Yang, et al. ∙

research

∙ 04/14/2023

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement

Low-light image enhancement (LLIE) investigates how to improve illuminat...

1 Yuhui Wu, et al. ∙

research

∙ 04/06/2023

DC^2: Dual-Camera Defocus Control by Learning to Refocus

Smartphone cameras today are increasingly approaching the versatility an...

0 Hadi AlZayer, et al. ∙

research

∙ 03/29/2023

When to Pre-Train Graph Neural Networks? An Answer from Data Generation Perspective!

Recently, graph pre-training has attracted wide research attention, whic...

0 Yuxuan Cao, et al. ∙

research

∙ 03/27/2023

Learning a Deep Color Difference Metric for Photographic Images

Most well-established and widely used color difference (CD) metrics are ...

0 Haoyu Chen, et al. ∙

research

∙ 03/23/2023

ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding

Aiming to link natural language descriptions to specific regions in a 3D...

0 Ziyang Lu, et al. ∙

research

∙ 03/21/2023

A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?

As ChatGPT goes viral, generative AI (AIGC, a.k.a AI-generated content) ...

0 Chaoning Zhang, et al. ∙

research

∙ 03/20/2023

Learning Behavior Recognition in Smart Classroom with Multiple Students Based on YOLOv5

Deep learning-based computer vision technology has grown stronger in rec...

0 Zhifeng Wang, et al. ∙

research

∙ 03/16/2023

Rt-Track: Robust Tricks for Multi-Pedestrian Tracking

Object tracking is divided into single-object tracking (SOT) and multi-o...

0 Yukuan Zhang, et al. ∙

research

∙ 03/13/2023

Guided Speech Enhancement Network

High quality speech capture has been widely studied for both voice commu...

0 Yang Yang, et al. ∙

research

∙ 03/08/2023

Graph Neural Networks Enhanced Smart Contract Vulnerability Detection of Educational Blockchain

With the development of blockchain technology, more and more attention h...

0 Zhifeng Wang, et al. ∙

research

∙ 02/25/2023

Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person Search

Weakly supervised person search aims to jointly detect and match persons...

0 Benzhi Wang, et al. ∙

research

∙ 02/24/2023

Bioinspired soft robotics: How do we learn from creatures?

Soft robotics has opened a unique path to flexibility and environmental ...

0 Yang Yang, et al. ∙

research

∙ 02/23/2023

Bayesian Structure Scores for Probabilistic Circuits

Probabilistic circuits (PCs) are a prominent representation of probabili...

0 Yang Yang, et al. ∙

research

∙ 02/17/2023

Cascaded information enhancement and cross-modal attention feature fusion for multispectral pedestrian detection

Multispectral pedestrian detection is a technology designed to detect an...

0 Yang Yang, et al. ∙

research

∙ 02/16/2023

Copebot: Underwater soft robot with copepod-like locomotion

It has been a great challenge to develop robots that are able to perform...

0 Zhiguo He, et al. ∙

research

∙ 02/14/2023

Stability analysis of the Eulerian-Lagrangian finite volume methods for nonlinear hyperbolic equations in one space dimension

In this paper, we construct a novel Eulerian-Lagrangian finite volume (E...

0 Yang Yang, et al. ∙

research

∙ 01/09/2023

VQNet 2.0: A New Generation Machine Learning Framework that Unifies Classical and Quantum

With the rapid development of classical and quantum machine learning, a ...

0 Huanyu Bian, et al. ∙

Yang Yang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro