Predicting Human Intentions from Motion Only: A 2D+3D Fusion Approach

by   Andrea Zunino, et al.

In this paper, we address the new problem of the prediction of human intents. There is neuro-psychological evidence that actions performed by humans are anticipated by peculiar motor acts which are discriminant of the type of action going to be performed afterwards. In other words, an actual intent can be forecast by looking at the kinematics of the immediately preceding movement. To prove it in a computational and quantitative manner, we devise a new experimental setup where, without using contextual information, we predict human intents all originating from the same motor act. We posit the problem as a classification task and we introduce a new multi-modal dataset consisting of a set of motion capture marker 3D data and 2D video sequences, where, by only analysing very similar movements in both training and test phases, we are able to predict the underlying intent, i.e., the future, never observed action. We also present an extensive experimental evaluation as a baseline, customizing state-of-the-art techniques for either 3D and 2D data analysis. Realizing that video processing methods lead to inferior performance but show complementary information with respect to 3D data sequences, we developed a 2D+3D fusion analysis where we achieve better classification accuracies, attesting the superiority of the multimodal approach for the context-free prediction of human intents.


page 2

page 5

page 11


What Will I Do Next? The Intention from Motion Experiment

In computer vision, video-based approaches have been widely explored for...

Prediction of Manipulation Actions

Looking at a person's hands one often can tell what the person is going ...

Action Anticipation for Collaborative Environments: The Impact of Contextual Information and Uncertainty-Based Prediction

For effectively interacting with humans in collaborative environments, m...

ParkPredict+: Multimodal Intent and Motion Prediction for Vehicles in Parking Lots with CNN and Transformer

The problem of multimodal intent and trajectory prediction for human-dri...

Social-PatteRNN: Socially-Aware Trajectory Prediction Guided by Motion Patterns

As robots across domains start collaborating with humans in shared envir...

CAMREP- Concordia Action and Motion Repository

Action recognition, motion classification, gait analysis and synthesis a...

Sequential Decision Fusion for Environmental Classification in Assistive Walking

Powered prostheses are effective for helping amputees walk on level grou...

Please sign up or login with your details

Forgot password? Click here to reset