Multi-Speaker Localization Using Convolutional Neural Network Trained with Noise

12/12/2017
by   Soumitro Chakrabarty, et al.
0

The problem of multi-speaker localization is formulated as a multi-class multi-label classification problem, which is solved using a convolutional neural network (CNN) based source localization method. Utilizing the common assumption of disjoint speaker activities, we propose a novel method to train the CNN using synthesized noise signals. The proposed localization method is evaluated for two speakers and compared to a well-known steered response power method.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset