research
          
      
      ∙
      05/04/2023
    What changes when you randomly choose BPE merge operations? Not much
We introduce three simple randomized variants of byte pair encoding (BPE...
          
            research
          
      
      ∙
      02/28/2022
    ParaNames: A Massively Multilingual Entity Name Corpus
This preprint describes work in progress on ParaNames, a multilingual pa...
          
            research
          
      
      ∙
      02/24/2022
    Toward More Meaningful Resources for Lower-resourced Languages
In this position paper, we describe our perspective on how meaningful re...
          
            research
          
      
      ∙
      04/01/2021
    Mining Wikidata for Name Resources for African Languages
This work supports further development of language technology for the la...
          
            research
          
      
      ∙
      03/20/2021
     
             
  
  
     
                             share
 share