b'Zhifang Sui'

research

∙ 09/11/2023

Large Language Model for Science: A Study on P vs. NP

In this work, we use large language models (LLMs) to augment and acceler...

0 Qingxiu Dong, et al. ∙

research

∙ 09/05/2023

Making Large Language Models Better Reasoners with Alignment

Reasoning is a cognitive process of using evidence to reach a sound conc...

0 Peiyi Wang, et al. ∙

research

∙ 05/29/2023

Large Language Models are not Fair Evaluators

We uncover a systematic bias in the evaluation paradigm of adopting larg...

0 Peiyi Wang, et al. ∙

research

∙ 05/25/2023

Learn to Not Link: Exploring NIL Prediction in Entity Linking

Entity linking models have achieved significant success via utilizing pr...

0 Fangwei Zhu, et al. ∙

research

∙ 05/24/2023

ImageNetVC: Zero-Shot Visual Commonsense Evaluation on 1000 ImageNet Categories

Recently, Pretrained Language Models (PLMs) have been serving as general...

0 Heming Xia, et al. ∙

research

∙ 05/24/2023

Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization

Pretrained language models have achieved remarkable success in a variety...

0 Shoujie Tong, et al. ∙

research

∙ 05/24/2023

Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion

Video multimodal fusion aims to integrate multimodal signals in videos, ...

0 Shaoxaing Wu, et al. ∙

research

∙ 05/17/2023

Statistical Knowledge Assessment for Generative Language Models

Generative Language Models (GLMs) have demonstrated capabilities to stor...

0 Qingxiu Dong, et al. ∙

research

∙ 05/12/2023

RepCL: Exploring Effective Representation for Continual Text Classification

Continual learning (CL) aims to constantly learn new knowledge over time...

0 Yifan Song, et al. ∙

research

∙ 05/08/2023

Enhancing Continual Relation Extraction via Classifier Decomposition

Continual relation extraction (CRE) models aim at handling emerging new ...

0 Heming Xia, et al. ∙

research

∙ 12/31/2022

A Survey for In-context Learning

With the increasing ability of large language models (LLMs), in-context ...

0 Qingxiu Dong, et al. ∙

research

∙ 12/20/2022

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers

Large pretrained language models have shown surprising In-Context Learni...

0 Damai Dai, et al. ∙

research

∙ 12/19/2022

Statistical Dataset Evaluation: Reliability, Difficulty, and Validity

Datasets serve as crucial training resources and model performance track...

0 Chengwen Wang, et al. ∙

research

∙ 12/14/2022

DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog

Harvesting question-answer (QA) pairs from customer service chatlog in t...

0 Xin Zheng, et al. ∙

research

∙ 10/20/2022

DialogUSR: Complex Dialogue Utterance Splitting and Reformulation for Multiple Intent Detection

While interacting with chatbots, users may elicit multiple intents in a ...

0 Haoran Meng, et al. ∙

research

∙ 10/10/2022

Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation

Continual relation extraction (CRE) aims to continually learn new relati...

0 Peiyi Wang, et al. ∙

research

∙ 10/07/2022

Calibrating Factual Knowledge in Pretrained Language Models

Previous literature has proved that Pretrained Language Models (PLMs) ca...

9 Qingxiu Dong, et al. ∙

research

∙ 09/01/2022

Less is More: Rethinking State-of-the-art Continual Relation Extraction Models with a Frustratingly Easy but Effective Approach

Continual relation extraction (CRE) requires the model to continually le...

0 Peiyi Wang, et al. ∙

research

∙ 07/31/2022

Neural Knowledge Bank for Pretrained Transformers

The ability of pretrained Transformers to remember factual knowledge is ...

18 Damai Dai, et al. ∙

research

∙ 05/02/2022

Robust Fine-tuning via Perturbation and Interpolation from In-batch Instances

Fine-tuning pretrained language models (PLMs) on downstream tasks has be...

0 Shoujie Tong, et al. ∙

research

∙ 04/30/2022

A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction

Most previous studies aim at extracting events from a single sentence, w...

0 Runxin Xu, et al. ∙

research

∙ 04/28/2022

HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text Classification

Hierarchical text classification (HTC) is a challenging subtask of multi...

7 Zihan Wang, et al. ∙

research

∙ 04/19/2022

ATP: AMRize Then Parse! Enhancing AMR Parsing with PseudoAMRs

As Abstract Meaning Representation (AMR) implicitly involves compound se...

0 Liang Chen, et al. ∙

research

∙ 04/18/2022

StableMoE: Stable Routing Strategy for Mixture of Experts

The Mixture-of-Experts (MoE) technique can scale up the model size of Tr...

0 Damai Dai, et al. ∙

research

∙ 04/15/2022

Mixture of Experts for Biomedical Question Answering

Biomedical Question Answering (BQA) has attracted increasing attention i...

0 Damai Dai, et al. ∙

research

∙ 03/30/2022

Lossless Speedup of Autoregressive Translation with Generalized Aggressive Decoding

In this paper, we propose Generalized Aggressive Decoding (GAD) – a nove...

0 Heming Xia, et al. ∙

research

∙ 10/15/2021

Hierarchical Curriculum Learning for AMR Parsing

Abstract Meaning Representation (AMR) parsing translates sentences to th...

0 Peiyi Wang, et al. ∙

research

∙ 09/27/2021

An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling

Few-Shot Sequence Labeling (FSSL) is a canonical solution for the taggin...

0 Peiyi Wang, et al. ∙

research

∙ 08/29/2021

Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification

Few-Shot Event Classification (FSEC) aims at developing a model for even...

0 Peiyi Wang, et al. ∙

research

∙ 06/21/2021

Explicit Interaction Network for Aspect Sentiment Triplet Extraction

Aspect Sentiment Triplet Extraction (ASTE) aims to recognize targets, th...

0 Peiyi Wang, et al. ∙

research

∙ 06/15/2021

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

Artificial Intelligence (AI), along with the recent progress in biomedic...

0 Ningyu Zhang, et al. ∙

research

∙ 04/20/2021

Problems and Countermeasures in Natural Language Processing Evaluation

Evaluation in natural language processing guides and promotes research o...

0 Qingxiu Dong, et al. ∙

research

∙ 04/18/2021

A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation

Large pretrained generative models like GPT-3 often suffer from hallucin...

0 Tianyu Liu, et al. ∙

research

∙ 04/18/2021

Knowledge Neurons in Pretrained Transformers

Large-scale pretrained language models are surprisingly good at recallin...

0 Damai Dai, et al. ∙

research

∙ 03/26/2021

Incorporating Connections Beyond Knowledge Embeddings: A Plug-and-Play Module to Enhance Commonsense Reasoning in Machine Reading Comprehension

Conventional Machine Reading Comprehension (MRC) has been well-addressed...

0 Damai Dai, et al. ∙

research

∙ 02/17/2021

Towards Faithfulness in Open Domain Table-to-text Generation from an Entity-centric View

In open domain table-to-text generation, we notice that the unfaithful g...

0 Tianyu Liu, et al. ∙

research

∙ 12/04/2020

Coarse-to-Fine Entity Representations for Document-level Relation Extraction

Document-level Relation Extraction (RE) requires extracting relations ex...

0 Damai Dai, et al. ∙

research

∙ 10/08/2020

An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference

The prior work on natural language inference (NLI) debiasing mainly targ...

6 Tianyu Liu, et al. ∙

research

∙ 10/08/2020

Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference

While discriminative neural network classifiers are generally preferred,...

0 Xiaoan Ding, et al. ∙

research

∙ 09/27/2020

Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions

Conventional Knowledge Graph Completion (KGC) assumes that all test enti...

0 Damai Dai, et al. ∙

research

∙ 03/05/2020

HypoNLI: Exploring the Artificial Patterns of Hypothesis-only Bias in Natural Language Inference

Many recent studies have shown that for models trained on datasets for n...

6 Tianyu Liu, et al. ∙

research

∙ 03/03/2020

XGPT: Cross-modal Generative Pre-Training for Image Captioning

While many BERT-based cross-modal pre-trained models produce excellent r...

9 Qiaolin Xia, et al. ∙

research

∙ 03/02/2020

Multi-View Learning for Vision-and-Language Navigation

Learning to navigate in a visual environment following natural language ...

7 Qiaolin Xia, et al. ∙

research

∙ 10/24/2019

Pun-GAN: Generative Adversarial Network for Pun Generation

In this paper, we focus on the task of generating a pun sentence given a...

0 Fuli Luo, et al. ∙

research

∙ 05/24/2019

A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer

Unsupervised text style transfer aims to transfer the underlying style o...

0 Fuli Luo, et al. ∙

research

∙ 05/21/2018

Incorporating Glosses into Neural Word Sense Disambiguation

Word Sense Disambiguation (WSD) aims to identify the correct meaning of ...

0 Fuli Luo, et al. ∙

research

∙ 11/27/2017

Table-to-text Generation by Structure-aware Seq2seq Learning

Table-to-text generation aims to generate a description for a factual ta...

0 Tianyu Liu, et al. ∙

research

∙ 09/01/2017

Order-Planning Neural Text Generation From Structured Data

Generating texts from structured data (e.g., a table) is important for v...

0 Lei Sha, et al. ∙

research

∙ 02/22/2017

Improving Chinese SRL with Heterogeneous Annotations

Previous studies on Chinese semantic role labeling (SRL) have concentrat...

0 Qiaolin Xia, et al. ∙

research

∙ 03/09/2016

Implicit Discourse Relation Classification via Multi-Task Neural Networks

Without discourse connectives, classifying implicit discourse relations ...

0 Yang Liu, et al. ∙

Zhifang Sui

Featured Co-authors

Sign in with Google

Consider DeepAI Pro