Representing Numbers in NLP: a Survey and a Vision

03/24/2021
by   Avijit Thawani, et al.
20

NLP systems rarely give special consideration to numbers found in text. This starkly contrasts with the consensus in neuroscience that, in the brain, numbers are represented differently from words. We arrange recent NLP work on numeracy into a comprehensive taxonomy of tasks and methods. We break down the subjective notion of numeracy into 7 subtasks, arranged along two dimensions: granularity (exact vs approximate) and units (abstract vs grounded). We analyze the myriad representational choices made by 18 previously published number encoders and decoders. We synthesize best practices for representing numbers in text and articulate a vision for holistic numeracy in NLP, comprised of design trade-offs and a unified evaluation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2022

Number Entity Recognition

Numbers are essential components of text, like any other word tokens, fr...
research
09/17/2019

Do NLP Models Know Numbers? Probing Numeracy in Embeddings

The ability to understand and work with numbers (numeracy) is critical f...
research
06/10/2021

Graph Neural Networks for Natural Language Processing: A Survey

Deep learning has become the dominant approach in coping with various ta...
research
11/10/2022

An Inclusive Notion of Text

Natural language processing researchers develop models of grammar, meani...
research
02/26/2021

Methods for the Design and Evaluation of HCI+NLP Systems

HCI and NLP traditionally focus on different evaluation methods. While H...
research
04/13/2021

EXPLAINABOARD: An Explainable Leaderboard for NLP

With the rapid development of NLP research, leaderboards have emerged as...
research
04/28/2023

SemEval-2023 Task 11: Learning With Disagreements (LeWiDi)

NLP datasets annotated with human judgments are rife with disagreements ...

Please sign up or login with your details

Forgot password? Click here to reset