Egocentric temporal action segmentation in videos is a crucial task in
c...
Cross view action recognition (CVAR) seeks to recognize a human action w...
In this work, we look at Score-based generative models (also called diff...
While score based generative models, or diffusion models, have found suc...
There are limited works showing the efficacy of unsupervised
Out-of-Dist...
Human interpretation of the world encompasses the use of symbols to
cate...
Human pose estimation in video relies on local information by either
est...
Missing data poses significant challenges while learning representations...
Recent advances in Convolutional Neural Network (CNN) model interpretabi...
The ability to anticipate the future is essential when making real time
...
Bilinear pooling has been recently proposed as a feature encoding layer,...
In this paper, we propose a pipeline for multi-target visual tracking un...
Person re-identification (re-id) is a critical problem in video analytic...
Person re-identification is critical in surveillance applications. Curre...
This paper presents a new approach, based on polynomial optimization and...