b'Yuxian Gu'

research

∙ 06/14/2023

Knowledge Distillation of Large Language Models

Knowledge Distillation (KD) is a promising technique for reducing the hi...

0 Yuxian Gu, et al. ∙

research

∙ 05/16/2023

Pre-Training to Learn in Context

In-context learning, where pre-trained language models learn to perform ...

0 Yuxian Gu, et al. ∙

research

∙ 12/13/2022

Structured Prompting: Scaling In-Context Learning to 1,000 Examples

Large language models have exhibited intriguing in-context learning capa...

0 Yaru Hao, et al. ∙

research

∙ 10/17/2022

Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization

Training language models to learn from human instructions for zero-shot ...

0 Yuxian Gu, et al. ∙

research

∙ 05/23/2022

Many-Class Text Classification with Matching

In this work, we formulate Text Classification as a Matching problem bet...

0 Yi Song, et al. ∙

research

∙ 03/17/2022

EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training

Large-scale pre-training has shown remarkable performance in building op...

0 Yuxian Gu, et al. ∙

research

∙ 09/09/2021

PPT: Pre-trained Prompt Tuning for Few-shot Learning

Prompts for pre-trained language models (PLMs) have shown remarkable per...

0 Yuxian Gu, et al. ∙

research

∙ 08/03/2021

EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training

Although pre-trained language models have remarkably enhanced the genera...

0 Hao Zhou, et al. ∙

research

∙ 12/01/2020

CPM: A Large-scale Generative Chinese Pre-trained Language Model

Pre-trained Language Models (PLMs) have proven to be beneficial for vari...

4 Zhengyan Zhang, et al. ∙

research

∙ 04/21/2020

Train No Evil: Selective Masking for Task-guided Pre-training

Recently, pre-trained language models mostly follow the pre-training-the...

0 Yuxian Gu, et al. ∙

research

∙ 08/30/2019

Adapting Meta Knowledge Graph Information for Multi-Hop Reasoning over Few-Shot Relations

Multi-hop knowledge graph (KG) reasoning is an effective and explainable...

14 Xin Lv, et al. ∙

Yuxian Gu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro