An Empirical Study of Discriminative Sequence Labeling Models for Vietnamese Text Processing

08/30/2017
by   Phuong Le-Hong, et al.
0

This paper presents an empirical study of two widely-used sequence prediction models, Conditional Random Fields (CRFs) and Long Short-Term Memory Networks (LSTMs), on two fundamental tasks for Vietnamese text processing, including part-of-speech tagging and named entity recognition. We show that a strong lower bound for labeling accuracy can be obtained by relying only on simple word-based features with minimal hand-crafted feature engineering, of 90.65% and 86.03% performance scores on the standard test sets for the two tasks respectively. In particular, we demonstrate empirically the surprising efficiency of word embeddings in both of the two tasks, with both of the two models. We point out that the state-of-the-art LSTMs model does not always outperform significantly the traditional CRFs model, especially on moderate-sized data sets. Finally, we give some suggestions and discussions for efficient use of sequence labeling models in practical applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2018

Neural sequence labeling for Vietnamese POS Tagging and NER

This paper presents a neural architecture for Vietnamese sequence labeli...
research
11/13/2020

A Survey on Recent Advances in Sequence Labeling from Deep Learning Models

Sequence labeling (SL) is a fundamental research problem encompassing a ...
research
12/21/2016

Sparse Coding of Neural Word Embeddings for Multilingual Sequence Labeling

In this paper we propose and carefully evaluate a sequence labeling fram...
research
09/27/2016

Modelling Radiological Language with Bidirectional Long Short-Term Memory Networks

Motivated by the need to automate medical information extraction from fr...
research
03/30/2021

Locally-Contextual Nonlinear CRFs for Sequence Labeling

Linear chain conditional random fields (CRFs) combined with contextual w...
research
07/17/2018

Bench-Marking Information Extraction in Semi-Structured Historical Handwritten Records

In this report, we present our findings from benchmarking experiments fo...
research
11/21/2016

Learning From Graph Neighborhoods Using LSTMs

Many prediction problems can be phrased as inferences over local neighbo...

Please sign up or login with your details

Forgot password? Click here to reset