Predicting the Performance of Multilingual NLP Models

10/17/2021
by   Anirudh Srinivasan, et al.
0

Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages. The languages that these models are evaluated on, however, are very few in number, and it is unlikely that evaluation datasets will cover all the languages that these models support. Potential solutions to the costly problem of dataset creation are to translate datasets to new languages or use template-filling based techniques for creation. This paper proposes an alternate solution for evaluating a model across languages which make use of the existing performance scores of the model on languages that a particular task has test sets for. We train a predictor on these performance scores and use this predictor to predict the model's performance in different evaluation settings. Our results show that our method is effective in filling the gaps in the evaluation for an existing set of languages, but might require additional improvements if we want it to generalize to unseen languages.

READ FULL TEXT

page 5

page 9

page 10

research
05/12/2022

Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages

Although recent Massively Multilingual Language Models (MMLMs) like mBER...
research
04/16/2021

Optimal Size-Performance Tradeoffs: Weighing PoS Tagger Models

Improvement in machine learning-based NLP performance are often presente...
research
03/29/2023

Summarizing Indian Languages using Multilingual Transformers based Models

With the advent of multilingual models like mBART, mT5, IndicBART etc., ...
research
05/02/2020

Predicting Performance for Natural Language Processing Tasks

Given the complexity of combinations of tasks, languages, and domains in...
research
06/19/2019

Unification of Template-Expansion and XML-Validation

The processing of XML documents often includes creation and validation. ...
research
05/25/2023

Morphological Inflection: A Reality Check

Morphological inflection is a popular task in sub-word NLP with both pra...
research
10/17/2020

PPL Bench: Evaluation Framework For Probabilistic Programming Languages

We introduce PPL Bench, a new benchmark for evaluating Probabilistic Pro...

Please sign up or login with your details

Forgot password? Click here to reset