Trajectorylet-Net: a novel framework for pose prediction based on trajectorylet descriptors
Pose prediction is an increasingly interesting topic in computer vision and robotics. In this paper, we propose a new network, Trajectorylet-Net, to predict future poses. Compared with most existing methods, our model focuses on modeling the co-occurrence long-term information and spatiotemporal correlation. Specifically, a novel descriptor, trajectorylet, is introduced to characterize the static and dynamic information of the input pose sequence. Then, a coupled spatio-temporal learning schema is proposed to generate trajectorylet descriptors, which can simultaneously capture the local structure of the human body and the global co-occurrence temporal information of the input sequence. Finally, we propose to predict future poses by gathering trajectorylet descriptors gradually. Extensive experiments show that our method achieves state-of-the-art performance on two benchmarks (e.g. G3D and FNTU), which demonstrates the effectiveness of our proposed method.
READ FULL TEXT