Layer compositing is one of the most popular image editing workflows amo...
Entity-aware image captioning aims to describe named entities and events...
This paper presents a new video question answering task on screencast
tu...
Weakly supervised object localization (WSOL) aims to locate objects in i...
Weakly Supervised Object Localization (WSOL) methodsusually rely on full...
Exploiting relationships among objects has achieved remarkable progress ...
Understanding web instructional videos is an essential branch of video
u...
Expectation maximization (EM) algorithm is to find maximum likelihood
so...