Design and implementation of an open source Greek POS Tagger and Entity Recognizer using spaCy

12/05/2019
by   Eleni Partalidou, et al.
0

This paper proposes a machine learning approach to part-of-speech tagging and named entity recognition for Greek, focusing on the extraction of morphological features and classification of tokens into a small set of classes for named entities. The architecture model that was used is introduced. The greek version of the spaCy platform was added into the source code, a feature that did not exist before our contribution, and was used for building the models. Additionally, a part of speech tagger was trained that can detect the morphology of the tokens and performs higher than the state-of-the-art results when classifying only the part of speech. For named entity recognition using spaCy, a model that extends the standard ENAMEX type (organization, location, person) was built. Certain experiments that were conducted indicate the need for flexibility in out-of-vocabulary words and there is an effort for resolving this issue. Finally, the evaluation results are discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2015

Kannada named entity recognition and classification (nerc) based on multinomial naïve bayes (mnb) classifier

Named Entity Recognition and Classification (NERC) is a process of ident...
research
06/29/2020

Improving Sequence Tagging for Vietnamese Text Using Transformer-based Neural Models

This paper describes our study on using mutilingual BERT embeddings and ...
research
10/18/2016

Vietnamese Named Entity Recognition using Token Regular Expressions and Bidirectional Inference

This paper describes an efficient approach to improve the accuracy of a ...
research
12/30/2021

KIND: an Italian Multi-Domain Dataset for Named Entity Recognition

In this paper we present KIND, an Italian dataset for Named-Entity Recog...
research
10/29/2020

May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance

We investigate using Named Entity Recognition on a new type of user-gene...
research
07/01/2022

Multi-features based Semantic Augmentation Networks for Named Entity Recognition in Threat Intelligence

Extracting cybersecurity entities such as attackers and vulnerabilities ...
research
04/05/2022

LAMNER: Code Comment Generation Using Character Language Model and Named Entity Recognition

Code comment generation is the task of generating a high-level natural l...

Please sign up or login with your details

Forgot password? Click here to reset