Language-Guided Traffic Simulation via Scene-Level Diffusion

06/10/2023
by   Ziyuan Zhong, et al.
9

Realistic and controllable traffic simulation is a core capability that is necessary to accelerate autonomous vehicle (AV) development. However, current approaches for controlling learning-based traffic models require significant domain expertise and are difficult for practitioners to use. To remedy this, we present CTG++, a scene-level conditional diffusion model that can be guided by language instructions. Developing this requires tackling two challenges: the need for a realistic and controllable traffic model backbone, and an effective method to interface with a traffic model using language. To address these challenges, we first propose a scene-level diffusion model equipped with a spatio-temporal transformer backbone, which generates realistic and controllable traffic. We then harness a large language model (LLM) to convert a user's query into a loss function, guiding the diffusion model towards query-compliant generation. Through comprehensive evaluation, we demonstrate the effectiveness of our proposed method in generating realistic, query-compliant traffic simulations.

READ FULL TEXT

page 14

page 15

page 16

research
10/31/2022

Guided Conditional Diffusion for Controllable Traffic Simulation

Controllable and realistic traffic simulation is critical for developing...
research
07/16/2023

Language Conditioned Traffic Generation

Simulation forms the backbone of modern self-driving development. Simula...
research
06/29/2023

Generate Anything Anywhere in Any Scene

Text-to-image diffusion models have attracted considerable interest due ...
research
06/08/2021

Semantically Controllable Scene Generation with Guidance of Explicit Knowledge

Deep Generative Models (DGMs) are known for their superior capability in...
research
09/01/2023

Reinforcement Learning with Human Feedback for Realistic Traffic Simulation

In light of the challenges and costs of real-world testing, autonomous v...
research
05/29/2023

Generating Driving Scenes with Diffusion

In this paper we describe a learned method of traffic scene generation d...
research
06/09/2023

Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions

What constitutes the "vibe" of a particular scene? What should one find ...

Please sign up or login with your details

Forgot password? Click here to reset