b'Chunting Zhou'

research

∙ 08/11/2023

Self-Alignment with Instruction Backtranslation

We present a scalable method to build a high quality instruction followi...

0 Xian Li, et al. ∙

research

∙ 07/25/2023

FacTool: Factuality Detection in Generative AI – A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios

The emergence of generative pre-trained models has facilitated the synth...

0 I-Chun Chern, et al. ∙

research

∙ 06/01/2023

Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Evaluation of natural language generation (NLG) is complex and multi-dim...

0 Sameer Jain, et al. ∙

research

∙ 05/22/2023

Look-back Decoding for Open-Ended Text Generation

Given a prefix (context), open-ended generation aims to decode texts tha...

0 Nan Xu, et al. ∙

research

∙ 05/18/2023

LIMA: Less Is More for Alignment

Large language models are trained in two stages: (1) unsupervised pretra...

0 Chunting Zhou, et al. ∙

research

∙ 12/19/2022

Training Trajectories of Language Models Across Scales

Scaling up language models has led to unprecedented performance gains, b...

0 Mengzhou Xia, et al. ∙

research

∙ 12/05/2022

In-context Examples Selection for Machine Translation

Large-scale generative models show an impressive ability to perform a wi...

0 Sweta Agrawal, et al. ∙

research

∙ 09/21/2022

Mega: Moving Average Equipped Gated Attention

The design choices in the Transformer attention mechanism, including wea...

2 Xuezhe Ma, et al. ∙

research

∙ 04/29/2022

Prompt Consistency for Zero-Shot Task Generalization

One of the most impressive results of recent NLP history is the ability ...

0 Chunting Zhou, et al. ∙

research

∙ 10/08/2021

Towards a Unified View of Parameter-Efficient Transfer Learning

Fine-tuning large pre-trained language models on downstream tasks has be...

0 Junxian He, et al. ∙

research

∙ 09/09/2021

Distributionally Robust Multilingual Machine Translation

Multilingual neural machine translation (MNMT) learns to translate multi...

9 Chunting Zhou, et al. ∙

research

∙ 06/14/2021

Examining and Combating Spurious Features under Distribution Shift

A central goal of machine learning is to learn robust representations th...

0 Chunting Zhou, et al. ∙

research

∙ 06/03/2021

Luna: Linear Unified Nested Attention

The quadratic computational and memory complexities of the Transformer's...

31 Xuezhe Ma, et al. ∙

research

∙ 05/27/2021

Learning Structures for Deep Neural Networks

In this paper, we focus on the unsupervised setting for structure learni...

0 Jinhui Yuan, et al. ∙

research

∙ 11/05/2020

Detecting Hallucinated Content in Conditional Neural Sequence Generation

Neural sequence models can generate highly fluent sentences but recent s...

2 Chunting Zhou, et al. ∙

research

∙ 11/07/2019

Understanding Knowledge Distillation in Non-autoregressive Machine Translation

Non-autoregressive machine translation (NAT) systems predict a sequence ...

0 Chunting Zhou, et al. ∙

research

∙ 09/05/2019

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

Most sequence-to-sequence (seq2seq) models are autoregressive; they gene...

0 Xuezhe Ma, et al. ∙

research

∙ 08/30/2019

Handling Syntactic Divergence in Low-resource Machine Translation

Despite impressive empirical successes of neural machine translation (NM...

0 Chunting Zhou, et al. ∙

research

∙ 04/04/2019

Density Matching for Bilingual Word Embedding

Recent approaches to cross-lingual word embedding have generally been ba...

0 Chunting Zhou, et al. ∙

research

∙ 02/24/2019

The ARIEL-CMU Systems for LoReHLT18

This paper describes the ARIEL-CMU submissions to the Low Resource Human...

0 Aditi Chaudhary, et al. ∙

research

∙ 01/06/2019

MAE: Mutual Posterior-Divergence Regularization for Variational AutoEncoders

Variational Autoencoder (VAE), a simple and effective deep generative mo...

6 Xuezhe Ma, et al. ∙

research

∙ 08/28/2018

Adapting Word Embeddings to New Languages with Morphological and Phonological Subword Representations

Much work in Natural Language Processing (NLP) has been for resource-ric...

0 Aditi Chaudhary, et al. ∙

research

∙ 06/20/2018

StructVAE: Tree-structured Latent Variable Models for Semi-supervised Semantic Parsing

Semantic parsing is the task of transducing natural language (NL) uttera...

0 Pengcheng Yin, et al. ∙

research

∙ 04/06/2017

Multi-space Variational Encoder-Decoders for Semi-supervised Labeled Sequence Transduction

Labeled sequence transduction is a task of transforming one sequence int...

0 Chunting Zhou, et al. ∙

research

∙ 11/27/2015

A C-LSTM Neural Network for Text Classification

Neural network models have been demonstrated to be capable of achieving ...

0 Chunting Zhou, et al. ∙

research

∙ 11/27/2015

Category Enhanced Word Embedding

Distributed word representations have been demonstrated to be effective ...

0 Chunting Zhou, et al. ∙

Chunting Zhou

Featured Co-authors

Sign in with Google

Consider DeepAI Pro