Joint source and channel coding (JSCC) has attracted increasing attentio...
Posts, as important containers of user-generated-content pieces on socia...
In a lumped hydrological model structure, the hydrological function of a...
Current privacy-aware joint source-channel coding (JSCC) works aim at
av...
In this report, we introduce NICE
project[<https://nice.lgresearch.ai/>]...
In e-commerce search, relevance between query and documents is an essent...
The unmanned aerial vehicle (UAV) network is popular these years due to ...
When robots retrieve specific objects from cluttered scenes, such as hom...
Most existing image-text matching methods adopt triplet loss as the
opti...
Most existing cross-modal retrieval methods employ two-stream encoders w...
Energy theft detection (ETD) and energy consumption forecasting (ECF) ar...
In the absence of an authoritative statement about a rumor, people may e...
Network embedding, a graph representation learning method illustrating
n...
Currently, the reduction in the parameter scale of large-scale pre-train...
Multi-dimensional classification (MDC) can be employed in a range of
app...
With the development of the multi-media internet, visual characteristics...
The increasing installation rate of wind power poses great challenges to...
Car detection, particularly through camera vision, has become a major fo...
Hit rate is a key performance metric in predicting process product quali...
Aiding humans with scientific designs is one of the most exciting of
art...
Currently, video behavior recognition is one of the most foundational ta...
A key puzzle in search, ads, and recommendation is that the ranking mode...
Video Moment Retrieval (VMR) aims at retrieving the most relevant events...
Finetuning pretrained language models (LMs) have enabled appealing
perfo...
Pretrained language models (LMs) have shown compelling performance on va...
As the size of transistors approaches the mesoscopic scale, existing ene...
Generative AI (AIGC, a.k.a. AI generated content) has made remarkable
pr...
We present an interactive visual framework named InternGPT, or iGPT for
...
We consider a real-world scenario in which a newly-established pilot pro...
Detecting Resident Space Objects (RSOs) and preventing collisions with o...
The capability of Large Language Models (LLMs) like ChatGPT to comprehen...
The combination of LiDAR and camera modalities is proven to be necessary...
Current vision-language retrieval aims to perform cross-modal instance
s...
Low-light image enhancement (LLIE) investigates how to improve illuminat...
Smartphone cameras today are increasingly approaching the versatility an...
Recently, graph pre-training has attracted wide research attention, whic...
Most well-established and widely used color difference (CD) metrics are
...
Aiming to link natural language descriptions to specific regions in a 3D...
As ChatGPT goes viral, generative AI (AIGC, a.k.a AI-generated content) ...
Deep learning-based computer vision technology has grown stronger in rec...
Object tracking is divided into single-object tracking (SOT) and multi-o...
High quality speech capture has been widely studied for both voice
commu...
With the development of blockchain technology, more and more attention h...
Weakly supervised person search aims to jointly detect and match persons...
Soft robotics has opened a unique path to flexibility and environmental
...
Probabilistic circuits (PCs) are a prominent representation of probabili...
Multispectral pedestrian detection is a technology designed to detect an...
It has been a great challenge to develop robots that are able to perform...
In this paper, we construct a novel Eulerian-Lagrangian finite volume (E...
With the rapid development of classical and quantum machine learning, a ...