Large Language Models (LLMs) have shown immense potential in multimodal
...
We introduce MAmmoTH, a series of open-source large language models (LLM...
Large-scale language models (LLMs), such as ChatGPT, are capable of
gene...
We introduce LyricWhiz, a robust, multilingual, and zero-shot automatic
...
Subject-driven image generation aims at generating images containing
cus...
In the era of extensive intersection between art and Artificial Intellig...
Text-guided image editing is widely needed in daily life, ranging from
p...
Self-supervised learning (SSL) has recently emerged as a promising parad...
This paper investigates the capabilities of Large Language Models (LLMs)...
Making image retrieval methods practical for real-world search applicati...
Interactive Natural Language Processing (iNLP) has emerged as a novel
pa...
The recent LLMs like GPT-4 and PaLM-2 have made tremendous progress in
s...
Question answering over knowledge bases is considered a difficult proble...
Recent text-to-image generation models like DreamBooth have made remarka...
Recently, there has been significant progress in teaching language model...
Conditioned diffusion models have demonstrated state-of-the-art text-to-...
Integrating free-text explanations to in-context learning of large langu...
Recent literature has shown that large language models (LLMs) are genera...
Building dialogue systems requires a large corpus of annotated dialogues...
While language Models store a massive amount of world knowledge implicit...
Research on text-to-image generation has witnessed significant progress ...
In this position paper, we propose a new approach to generating a type o...
A pressing challenge in current dialogue systems is to successfully conv...
Retrieval augmented language models have recently become the standard fo...
With a rise in false, inaccurate, and misleading information in propagan...
Task-adaptive pre-training (TAPT) and Self-training (ST) have emerged as...
The sheer volume of financial statements makes it difficult for humans t...
Time is an important dimension in our physical world. Lots of facts can
...
In comparison to the interpretation of classification models, the explan...
Although deep learning models have driven state-of-the-art performance o...
Knowledge bases (KBs) and text often contain complementary knowledge: KB...
Neural models for automated fact verification have achieved promising re...
Obtaining training data for Multi-hop Question Answering (QA) is extreme...
In open question answering (QA), the answer to a question is produced by...
One major task of spoken language understanding (SLU) in modern personal...
Data-to-text generation has recently attracted substantial interests due...
Previous works on Natural Language Generation (NLG) from structured data...
Neural natural language generation (NLG) models have recently shown
rema...
Existing question answering datasets focus on dealing with homogeneous
i...
We introduce a new task, Video-and-Language Inference, for joint multimo...
Large-scale knowledge graphs (KGs) are shown to become more important in...
There are two main lines of research on visual reasoning: neural module
...
The problem of verifying whether a textual hypothesis holds the truth ba...
Time series forecasting is an important problem across many domains,
inc...
Pre-trained embeddings such as word embeddings and sentence embeddings a...
Semantically controlled neural response generation on limited-domain has...
With the rapid development in deep learning, deep neural networks have b...
With the recent surge of interests in deep neural networks, more real-wo...
Sequence-to-Sequence models were introduced to tackle many real-life pro...
Task-oriented dialog systems are becoming pervasive, and many companies
...