Empowering Cross-lingual Abilities of Instruction-tuned Large Language Models by Translation-following demonstrations

08/27/2023
by   Leonardo Ranaldi, et al.
0

The language ability of Large Language Models (LLMs) is often unbalanced towards English because of the imbalance in the distribution of the pre-training data. This disparity is demanded in further fine-tuning and affecting the cross-lingual abilities of LLMs. In this paper, we propose to empower Instructiontuned LLMs (It-LLMs) in languages other than English by building semantic alignment between them. Hence, we propose CrossAlpaca, an It-LLM with cross-lingual instruction-following and Translation-following demonstrations to improve semantic alignment between languages. We validate our approach on the multilingual Question Answering (QA) benchmarks XQUAD and MLQA and adapted versions of MMLU and BBH. Our models, tested over six different languages, outperform the It-LLMs tuned on monolingual data. The final results show that instruction tuning on non-English data is not enough and that semantic alignment can be further improved by Translation-following demonstrations.

READ FULL TEXT

page 5

page 13

research
08/09/2023

Extrapolating Large Language Models to Non-English by Aligning Languages

Due to the unbalanced training data distribution, the language ability o...
research
05/23/2023

Instruct-Align: Teaching Novel Languages with to LLMs through Alignment-based Cross-Lingual Instruction

Instruction-tuned large language models (LLMs) have shown remarkable gen...
research
06/19/2023

BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

Large language models (LLMs) have demonstrated remarkable prowess in lan...
research
12/06/2016

Cross-Lingual Predicate Mapping Between Linked Data Ontologies

Ontologies in different natural languages often differ in quality in ter...
research
04/19/2023

A Latent Space Theory for Emergent Abilities in Large Language Models

Languages are not created randomly but rather to communicate information...
research
09/16/2023

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Foundational large language models (LLMs) can be instruction-tuned to de...
research
06/06/2019

Cross-Lingual Training for Automatic Question Generation

Automatic question generation (QG) is a challenging problem in natural l...

Please sign up or login with your details

Forgot password? Click here to reset