In this paper, we for the first time explore helpful multi-modal context...
Vision Language Models (VLMs), which extend Large Language Models (LLM) ...
Large Language Models (LLMs) are becoming increasingly smart and autonom...
ChatGPT, an AI chatbot, has gained popularity for its capability in
gene...
We present WebGLM, a web-enhanced question-answering system based on the...
We introduce MQ-Det, an efficient architecture and pre-training strategy...
Despite the recent emergence of video captioning models, how to generate...
There are many news articles reporting the obstacles confronting
poverty...
Reinforcement Learning (RL) algorithms can solve challenging control pro...
We introduce GLM-130B, a bilingual (English and Chinese) pre-trained lan...
This work proposes a new framework for a socially-aware dynamic local pl...
Facial affect analysis remains a challenging task with its setting
trans...
This paper develops a novel framework called Providers-Clients-Robots (P...
This paper presents PyTSK, a Python toolbox for developing Takagi-Sugeno...
In this paper we want to address the problem of automation for recogniti...
Machine learning has long been considered as a black box for predicting
...
We consider online resource allocation under a typical non-profit settin...
Matching markets involve heterogeneous agents (typically from two partie...
Vision transformers have recently received explosive popularity, but the...
In this paper, we detail the relationship between convolutions and
self-...
Big progress has been achieved in domain adaptation in decades. Existing...
In this paper, we present a regression-based pose recognition method usi...
In this paper, we present Co-scale conv-attentional image Transformers
(...
We consider a class of queries called durability prediction queries that...
Takagi-Sugeno-Kang (TSK) fuzzy system with Gaussian membership functions...
In this paper, we present a holistically end-to-end algorithm for line
s...
Several scientific studies have reported the existence of the income gap...
With the popularity of the Internet, traditional offline resource alloca...
Deep implicit field regression methods are effective for 3D reconstructi...
Data-driven AI promises support for pathologists to discover sparse tumo...
We study the combinatorial sleeping multi-armed semi-bandit problem with...
A brain-computer interface (BCI) enables a user to communicate with a
co...
A brain-computer interface (BCI) enables a user to communicate directly ...
We propose an algorithm, guided variational autoencoder (Guided-VAE), th...
Neural inductive program synthesis is a task generating instructions tha...
Exposure bias describes the phenomenon that a language model trained und...
Drowsy driving is pervasive, and also a major cause of traffic accidents...
Time-lapse is a technology used to record the development of embryos dur...
Large sequences of images (or movies) can now be obtained on an unpreced...
Task parallelism is designed to simplify the task of parallel programmin...
Deep neural networks have enjoyed remarkable success for various vision
...