Descartes: Generating Short Descriptions of Wikipedia Articles

05/20/2022
by   Marija Sakota, et al.
0

We introduce and tackle the problem of automatically generating short descriptions of Wikipedia articles (e.g., Belgium has a short description Country in Western Europe). We introduce Descartes, a model that can generate descriptions performing on par with human editors. Our human evaluation results indicate that Descartes is preferred over editor-written descriptions about 50 of time. Further manual analysis show that Descartes generates descriptions considered as "valid" for 91.3 descriptions. Such performances are made possible by integrating other signals naturally existing in Wikipedia: (i) articles about the same entity in different languages, (ii) existing short descriptions in other languages, and (iii) structural information from Wikidata. Our work has direct practical applications in helping Wikipedia editors to provide short descriptions for the more than 9 million articles still missing one. Finally, our proposed architecture can easily be re-purposed to address other information gaps in Wikipedia.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset