Wanrong Zhu

research

∙ 09/14/2023

Approximate co-sufficient sampling with regularization

In this work, we consider the problem of goodness-of-fit (GoF) testing f...

0 Wanrong Zhu, et al. ∙

research

∙ 08/12/2023

VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use

We introduce VisIT-Bench (Visual InsTruction Benchmark), a benchmark for...

0 Yonatan Bitton, et al. ∙

research

∙ 08/02/2023

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

We introduce OpenFlamingo, a family of autoregressive vision-language mo...

0 Anas Awadalla, et al. ∙

research

∙ 07/13/2023

Weighted Averaged Stochastic Gradient Descent: Asymptotic Normality and Optimality

Stochastic Gradient Descent (SGD) is one of the simplest and most popula...

0 Ziyang Wei, et al. ∙

research

∙ 07/12/2023

VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View

Incremental decision making in real-world environments is one of the mos...

0 Raphael Schumann, et al. ∙

research

∙ 05/24/2023

LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

Attaining a high degree of user controllability in visual generation oft...

6 Weixi Feng, et al. ∙

research

∙ 05/18/2023

Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation

The field of text-to-image (T2I) generation has garnered significant att...

0 Wanrong Zhu, et al. ∙

research

∙ 05/02/2023

Multimodal Procedural Planning via Dual Text-Image Prompting

Embodied agents have achieved prominent performance in following human i...

5 Yujie Lu, et al. ∙

research

∙ 01/27/2023

Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning

In recent years, pre-trained large language models have demonstrated rem...

1 Xinyi Wang, et al. ∙

research

∙ 10/11/2022

CLIP also Understands Text: Prompting CLIP for Phrase Understanding

Contrastive Language-Image Pretraining (CLIP) efficiently learns visual ...

6 An Yan, et al. ∙

research

∙ 10/07/2022

Visualize Before You Write: Imagination-Guided Open-Ended Text Generation

Recent advances in text-to-image synthesis make it possible to visualize...

4 Wanrong Zhu, et al. ∙

research

∙ 06/06/2022

Neuro-Symbolic Causal Language Planning with Commonsense Prompting

Language planning aims to implement complex high-level goals by decompos...

0 Yujie Lu, et al. ∙

research

∙ 04/18/2022

Imagination-Augmented Natural Language Understanding

Human brains integrate linguistic and perceptual information simultaneou...

4 Yujie Lu, et al. ∙

research

∙ 04/18/2022

End-to-end Dense Video Captioning as Sequence Generation

Dense video captioning aims to identify the events of interest in an inp...

3 Wanrong Zhu, et al. ∙

research

∙ 06/10/2021

ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation

Automatic evaluations for natural language generation (NLG) conventional...

16 Wanrong Zhu, et al. ∙

research

∙ 03/30/2021

Diagnosing Vision-and-Language Navigation: What Really Matters

Vision-and-language navigation (VLN) is a multimodal task where an agent...

10 Wanrong Zhu, et al. ∙

research

∙ 10/07/2020

Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

A major challenge in visually grounded language generation is to build r...

0 Wanrong Zhu, et al. ∙

research

∙ 07/01/2020

Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation

In the vision-and-language navigation (VLN) task, an agent follows natur...

0 Wanrong Zhu, et al. ∙

research

∙ 02/10/2020

A Fully Online Approach for Covariance Matrices Estimation of Stochastic Gradient Descent Solutions

Stochastic gradient descent (SGD) algorithm is widely used for parameter...

10 Wanrong Zhu, et al. ∙

research

∙ 01/01/2019

Text Infilling

Recent years have seen remarkable progress of text generation in differe...

0 Wanrong Zhu, et al. ∙

research

∙ 09/04/2018

Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation

We introduce Texar, an open-source toolkit aiming to support the broad s...

0 Zhiting Hu, et al. ∙

Wanrong Zhu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro