Tuplemax Loss for Language Identification
In many scenarios of a language identification task, the user will specify a small set of languages which he/she can speak instead of a large set of all possible languages. We want to model such prior knowledge into the way we train our neural networks, by replacing the commonly used softmax loss function with a novel loss function named tuplemax loss. As a matter of fact, a typical language identification system launched in North America has about 95 who could speak no more than two languages. Using the tuplemax loss, our system achieved a 2.33 3.85
READ FULL TEXT