From BERT to GPT-3 Codex: Harnessing the Potential of Very Large Language Models for Data Management

by   Immanuel Trummer, et al.

Large language models have recently advanced the state of the art on many natural language processing benchmarks. The newest generation of models can be applied to a variety of tasks with little to no specialized training. This technology creates various opportunities for applications in the context of data management. The tutorial will introduce participants to basic background on language models, discuss different methods to use language models, and give an overview and short demonstration of available libraries and APIs. Models for generating natural language will be considered as well as models, such as GPT-3 Codex, which complete program code or generate code from natural language instructions. Finally, the tutorial will discuss recent research in the database community that exploits language models in the context of traditional database systems or proposes novel system architectures that are based on them. The tutorial is targeted at database researchers. No prior background on language models is required. The goal of the tutorial is to introduce database researchers to the latest generation of language models, and to their use cases in the domain of data management.


page 1

page 2

page 3

page 4


Can Large Language Models design a Robot?

Large Language Models can lead researchers in the design of robots....

Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap

We present Text-to-OverpassQL, a task designed to facilitate a natural l...

Large Language Models Meet NL2Code: A Survey

The task of generating code from a natural language description, or NL2C...

Enhancing Network Management Using Code Generated by Large Language Models

Analyzing network topologies and communication graphs plays a crucial ro...

Harnessing Scalable Transactional Stream Processing for Managing Large Language Models [Vision]

Large Language Models (LLMs) have demonstrated extraordinary performance...

Geotechnical Parrot Tales (GPT): Harnessing Large Language Models in geotechnical engineering

The widespread adoption of large language models (LLMs), such as OpenAI'...

Context-based Ontology Modelling for Database: Enabling ChatGPT for Semantic Database Management

This research paper explores the use of ChatGPT in database management. ...

Please sign up or login with your details

Forgot password? Click here to reset