Previous Multimodal Information based Speech Processing (MISP) challenge...
This technical report details our submission system to the CHiME-7 DASR
...
In recent research, slight performance improvement is observed from auto...
Reasoning, a crucial aspect of NLP research, has not been adequately
add...
Accurate real-time traffic flow prediction can be leveraged to relieve
t...
Object detection is a critical component of various security-sensitive
a...
In Causal Discovery with latent variables, We define two data paradigms:...
The affective reasoning task is a set of emerging affect-based tasks in
...
The Multi-modal Information based Speech Processing (MISP) challenge aim...
Audio-visual approaches involving visual inputs have laid the foundation...
Speech pre-training has shown great success in learning useful and gener...
Self-supervised learning (SSL) models have achieved considerable improve...
Photonic neural networks are brain-inspired information processing techn...
In this paper, we propose a deep learning based multi-speaker direction ...
Increasing the layer number of on-chip photonic neural networks (PNNs) i...
Understanding causality helps to structure interventions to achieve spec...
Emotion-cause pair extraction (ECPE) is an emerging task aiming to extra...
Tremendous efforts have been made on instance segmentation but the mask
...
Federated learning (FL) is a promising distributed learning solution tha...
In this paper, we propose a novel deep learning architecture to improvin...
In this paper, we propose a visual embedding approach to improving embed...
Boolean functions with high algebraic immunity are important cryptograph...
Data analysts commonly utilize statistics to summarize large datasets. W...
To investigate whether and to what extent central serous chorioretinopat...
Data analysts commonly utilize statistics to summarize large datasets. W...