JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition

by   Lucian Bicsi, et al.

We propose JEDI, a multi-dataset semi-supervised learning method, which efficiently combines knowledge from multiple experts, learned on different datasets, to train and improve the performance of individual, per dataset, student models. Our approach achieves this by addressing two important problems in current machine learning research: generalization across datasets and limitations of supervised training due to scarcity of labeled data. We start with an arbitrary number of experts, pretrained on their own specific dataset, which form the initial set of student models. The teachers are immediately derived by concatenating the feature representations from the penultimate layers of the students. We then train all models in a student-teacher semi-supervised learning scenario until convergence. In our efficient approach, student-teacher training is carried out jointly and end-to-end, showing that both students and teachers improve their generalization capacity during training. We validate our approach on four video action recognition datasets. By simultaneously considering all datasets within a unified semi-supervised setting, we demonstrate significant improvements over the initial experts.


page 1

page 2

page 3

page 4


Snowball: Iterative Model Evolution and Confident Sample Discovery for Semi-Supervised Learning on Very Small Labeled Datasets

In this work, we develop a joint sample discovery and iterative model ev...

Semi-supervised Learning with a Teacher-student Network for Generalized Attribute Prediction

This paper presents a study on semi-supervised learning to solve the vis...

Teacher-Student chain for efficient semi-supervised histology image classification

Deep learning shows great potential for the domain of digital pathology....

On the pitfalls of entropy-based uncertainty for multi-class semi-supervised segmentation

Semi-supervised learning has emerged as an appealing strategy to train d...

Efficient Action Recognition Using Confidence Distillation

Modern neural networks are powerful predictive models. However, when it ...

Semi-supervised Learning for Segmentation of Bleeding Regions in Video Capsule Endoscopy

In the realm of modern diagnostic technology, video capsule endoscopy (V...

Please sign up or login with your details

Forgot password? Click here to reset