Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation

04/11/2021
by   Philipp Scharpf, et al.
0

Mathematical information retrieval (MathIR) applications such as semantic formula search and question answering systems rely on knowledge-bases that link mathematical expressions to their natural language names. For database population, mathematical formulae need to be annotated and linked to semantic concepts, which is very time-consuming. In this paper, we present our approach to structure and speed up this process by supporting annotators with a system that suggests formula names and meanings of mathematical identifiers. We test our approach annotating 25 articles on https://en.wikipedia.org. We evaluate the quality and time-savings of the annotation recommendations. Moreover, we watch editor reverts and comments on Wikipedia formula entity links and Wikidata item creation and population to ground the formula semantics. Our evaluation shows that the AI guidance was able to significantly speed up the annotation process by a factor of 1.4 for formulae and 2.4 for identifiers. Our contributions were reverted in 12 the Wikidata items within a test window of one month. The >>AnnoMathTeX<< annotation recommender system is hosted by Wikimedia at https://annomathtex.wmflabs.org. In the future, our data refinement pipeline is ready to be integrated seamlessly into the Wikipedia user interface.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2022

Mining Mathematical Documents for Question Answering via Unsupervised Formula Labeling

The increasing number of questions on Question Answering (QA) platforms ...
research
03/03/2023

Discovery and Recognition of Formula Concepts using Machine Learning

Citation-based Information Retrieval (IR) methods for scientific documen...
research
12/04/2020

ARQMath Lab: An Incubator for Semantic Formula Search in zbMATH Open?

The zbMATH database contains more than 4 million bibliographic entries. ...
research
04/27/2015

Exploring semantically-related concepts from Wikipedia: the case of SeRE

In this paper we present our web application SeRE designed to explore se...
research
06/28/2019

Introducing MathQA – A Math-Aware Question Answering System

We present an open source math-aware Question Answering System based on ...
research
11/13/2018

An Analysis of the Semantic Annotation Task on the Linked Data Cloud

Semantic annotation, the process of identifying key-phrases in texts and...
research
11/17/2022

Data-Efficient Autoregressive Document Retrieval for Fact Verification

Document retrieval is a core component of many knowledge-intensive natur...

Please sign up or login with your details

Forgot password? Click here to reset