ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation

07/09/2020
by   Chuang Gan, et al.
10

We introduce ThreeDWorld (TDW), a platform for interactive multi-modal physical simulation. With TDW, users can simulate high-fidelity sensory data and physical interactions between mobile agents and objects in a wide variety of rich 3D environments. TDW has several unique properties: 1) realtime near photo-realistic image rendering quality; 2) a library of objects and environments with materials for high-quality rendering, and routines enabling user customization of the asset library; 3) generative procedures for efficiently building classes of new environments 4) high-fidelity audio rendering; 5) believable and realistic physical interactions for a wide variety of material types, including cloths, liquid, and deformable objects; 6) a range of "avatar" types that serve as embodiments of AI agents, with the option for user avatar customization; and 7) support for human interactions with VR devices. TDW also provides a rich API enabling multiple agents to interact within a simulation and return a range of sensor and physics data representing the state of the world. We present initial experiments enabled by the platform around emerging research directions in computer vision, machine learning, and cognitive science, including multi-modal physical scene understanding, multi-agent interactions, models that "learn like a child", and attention studies in humans and neural networks. The simulation platform will be made publicly available.

READ FULL TEXT

page 3

page 6

page 9

page 10

page 12

page 13

page 14

page 15

research
03/02/2023

Alexa Arena: A User-Centric Interactive Platform for Embodied AI

We introduce Alexa Arena, a user-centric simulation platform for Embodie...
research
09/07/2018

Unity: A General Platform for Intelligent Agents

Recent advances in Deep Reinforcement Learning and Robotics have been dr...
research
12/27/2022

Audiovisual Database with 360 Video and Higher-Order Ambisonics Audio for Perception, Cognition, Behavior, and QoE Evaluation Research

Research into multi-modal perception, human cognition, behavior, and att...
research
05/03/2021

VECA : A Toolkit for Building Virtual Environments to Train and Test Human-like Agents

Building human-like agent, which aims to learn and think like human inte...
research
07/07/2022

Finding Fallen Objects Via Asynchronous Audio-Visual Integration

The way an object looks and sounds provide complementary reflections of ...
research
11/25/2022

TPA-Net: Generate A Dataset for Text to Physics-based Animation

Recent breakthroughs in Vision-Language (V L) joint research have achi...
research
10/02/2018

Scientific image rendering for space scenes with the SurRender software

Spacecraft autonomy can be enhanced by vision-based navigation (VBN) tec...

Please sign up or login with your details

Forgot password? Click here to reset