The development of correct and efficient software can be hindered by
com...
Predicting potential risks associated with the fatigue of key structural...
Dental caries is one of the most common oral diseases that, if left
untr...
In sequential recommendation, multi-modal information (e.g., text or ima...
Despite recent advances in semantic segmentation, an inevitable challeng...
Large Language Models (LLMs) have shown promise in multiple software
eng...
Visual bird's eye view (BEV) semantic segmentation helps autonomous vehi...
Siamese network has been a de facto benchmark framework for 3D LiDAR obj...
Large-scale well-annotated datasets are of great importance for training...
The conjugate gradient method is a crucial first-order optimization meth...
Class prototype construction and matching are core aspects of few-shot a...
Multi-hop QA involves finding multiple relevant passages and step-by-ste...
Large Language Models (LLMs) have revolutionized natural language proces...
Structured pruning and quantization are promising approaches for reducin...
Benefiting from prompt tuning, recent years have witnessed the promising...
Recent years have witnessed a few attempts of vision transformers for si...
Unsupervised representation learning approaches aim to learn discriminat...
Applying large-scale pre-trained visual models like CLIP to few-shot act...
Gradient clipping is a commonly used technique to stabilize the training...
Molecular property prediction has gained significant attention due to it...
In the realm of Tiny AI, we introduce "You Only Look at Interested Cells...
The field of cooperative multi-agent reinforcement learning (MARL) has s...
Neural network quantization is a very promising solution in the field of...
Semantic scene completion (SSC) jointly predicts the semantics and geome...
Reliable LiDAR panoptic segmentation (LPS), including both semantic and
...
Deep neural networks have been widely used in medical image analysis and...
Real-world time series is characterized by intrinsic non-stationarity th...
Protecting deep neural networks (DNNs) against intellectual property (IP...
Image harmonization aims to solve the visual inconsistency problem in
co...
Graph neural networks (GNNs) are the most widely adopted model in
graph-...
Multi-view stereo depth estimation based on cost volume usually works be...
Multimodal-driven talking face generation refers to animating a portrait...
Recently, emotional talking face generation has received considerable
at...
We propose a novel visual re-localization method based on direct matchin...
Most of the existing blind image Super-Resolution (SR) methods assume th...
Neural Architectures Search (NAS) becomes more and more popular over the...
Federated Magnetic Resonance Imaging (MRI) reconstruction enables multip...
Deep learning models have shown promising performance in the field of
di...
Recently, neural implicit surfaces have become popular for multi-view
re...
Immersive multimedia applications, such as Virtual, Augmented and Mixed
...
Medical phrase grounding (MPG) aims to locate the most relevant region i...
Place recognition is a challenging yet crucial task in robotics. Existin...
We propose efficient and parallel algorithms for the implementation of t...
Pairwise point cloud registration is a critical task for many applicatio...
One-shot image generation (OSG) with generative adversarial networks tha...
High-order Takagi-Sugeno-Kang (TSK) fuzzy classifiers possess powerful
c...
Real-world cooperation often requires intensive coordination among agent...
Differentiable Neural Architecture Search (DARTS) is becoming more and m...
Differentiable Architecture Search (DARTS) is a simple yet efficient Neu...
In this paper, we consider Discretized Neural Networks (DNNs) consisting...