Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset

10/10/2022
by   Guanyi Chen, et al.
0

Previous work on Neural Referring Expression Generation (REG) all uses WebNLG, an English dataset that has been shown to reflect a very limited range of referring expression (RE) use. To tackle this issue, we build a dataset based on the OntoNotes corpus that contains a broader range of RE use in both English and Chinese (a language that uses zero pronouns). We build neural Referential Form Selection (RFS) models accordingly, assess them on the dataset and conduct probing experiments. The experiments suggest that, compared to WebNLG, OntoNotes is better for assessing REG/RFS models. We compare English and Chinese RFS and confirm that, in line with linguistic theories, Chinese RFS depends more on discourse context than English.

READ FULL TEXT
research
08/30/2018

Chinese Discourse Segmentation Using Bilingual Discourse Commonality

Discourse segmentation aims to segment Elementary Discourse Units (EDUs)...
research
07/04/2023

CARE-MI: Chinese Benchmark for Misinformation Evaluation in Maternity and Infant Care

The recent advances in NLP, have led to a new trend of applying LLMs to ...
research
03/09/2020

Shallow Discourse Annotation for Chinese TED Talks

Text corpora annotated with language-related properties are an important...
research
11/18/2017

Is China Entering WTO or shijie maoyi zuzhi--a Corpus Study of English Acronyms in Chinese Newspapers

This is one of the first studies that quantitatively examine the usage o...
research
06/07/2023

A New Dataset and Empirical Study for Sentence Simplification in Chinese

Sentence Simplification is a valuable technique that can benefit languag...
research
04/19/2018

A Predictive Model for Notional Anaphora in English

Notional anaphors are pronouns which disagree with their antecedents' gr...
research
08/06/2020

Studying Politeness across Cultures Using English Twitter and Mandarin Weibo

Modeling politeness across cultures helps to improve intercultural commu...

Please sign up or login with your details

Forgot password? Click here to reset