CHORUS: Foundation Models for Unified Data Discovery and Exploration

06/16/2023
by   Moe Kayali, et al.
0

We explore the application of foundation models to data discovery and exploration tasks. Foundation models are large language models (LLMs) that show promising performance on a range of diverse tasks unrelated to their training. We show that these models are highly applicable to the data discovery and data exploration domain. When carefully used, they have superior capability on three representative tasks: table-class detection, column-type annotation and join-column prediction. On all three tasks, we show that a foundation-model-based approach outperforms the task-specific models and so the state of the art. Further, our approach often surpasses human-expert task performance. This suggests a future direction in which disparate data management tasks can be unified under foundation models.

READ FULL TEXT

page 1

page 3

page 4

research
06/01/2023

Column Type Annotation using ChatGPT

Column type annotation is the task of annotating the columns of a relati...
research
04/23/2023

Segment Anything in Non-Euclidean Domains: Challenges and Opportunities

The recent work known as Segment Anything (SA) has made significant stri...
research
10/26/2022

A Case for Business Process-Specific Foundation Models

The inception of large language models has helped advance state-of-the-a...
research
09/16/2023

An Unified Search and Recommendation Foundation Model for Cold-Start Scenario

In modern commercial search engines and recommendation systems, data fro...
research
11/26/2021

A Ubiquitous Unifying Degeneracy in 2-body Microlensing Systems

While gravitational microlensing by planetary systems can provide unique...
research
05/20/2022

Can Foundation Models Wrangle Your Data?

Foundation Models (FMs) are models trained on large corpora of data that...
research
08/11/2023

Decentralised Governance for Foundation Model based AI Systems: Exploring the Role of Blockchain in Responsible AI

Foundation models including large language models (LLMs) are increasingl...

Please sign up or login with your details

Forgot password? Click here to reset