While large language models (LLMs) have demonstrated remarkable capabili...
Sentence embedding is one of the most fundamental tasks in Natural Langu...
Most named entity recognition (NER) systems focus on improving model
per...
This paper aims to improve contrastive learning for sentence embeddings ...
Static word embedding is still useful, particularly for context-unavaila...
Generating proper embedding of sentences through an unsupervised way is
...
Large language models have demonstrated surprising ability to perform
in...
To protect user privacy and meet legal regulations, federated learning (...
Textual logical reasoning, especially question answering (QA) tasks with...
Probing is popular to analyze whether linguistic information can be capt...
Current practices in metric evaluation focus on one single dataset, e.g....
Structured prediction models aim at solving a type of problem where the
...
Paraphrase generation is an important NLP task that has achieved signifi...
Recently, retrieval-augmented text generation attracted increasing atten...
Recently, it has been shown that natural language processing (NLP) model...
In many situations (e.g., distant supervision), unlabeled entity problem...
Computer-aided translation (CAT), the use of software to assist a human
...
Automatic machine translation is super efficient to produce translations...
Prior work has proved that Translation memory (TM) can boost the perform...
An important aspect of developing dialogue systems is how to evaluate an...
Prior methods to text segmentation are mostly at token level. Despite th...
This technique report introduces TexSmart, a text understanding system t...
In many scenarios, named entity recognition (NER) models severely suffer...
In this work, we present Lexical Unit Analysis (LUA), a framework for ge...
Many efforts have been devoted to extracting constituency trees from
pre...
Recently many efforts have been devoted to interpreting the black-box NM...
Despite the great success of NMT, there still remains a severe challenge...
Generalization to unseen instances is our eternal pursuit for all data-d...
Context gates are effective to control the contributions from the source...
Lexically constrained decoding for machine translation has shown to be
b...
Comments of online articles provide extended views and improve user
enga...
The attention mechanisim is appealing for neural machine translation, si...