Faisal Ahmed | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Jianfeng Gao
241 publications
Yu Cheng
139 publications
Zhe Gan
102 publications
Zicheng Liu
81 publications
Jingjing Liu
74 publications
Yun-Nung Chen
67 publications
Lijuan Wang
65 publications
Jianfeng Wang
64 publications
Lihong Li
62 publications
Michael Zeng
51 publications
Kevin Lin
46 publications

research

∙ 03/20/2023

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

We propose MM-REACT, a system paradigm that integrates ChatGPT with a po...

0 Zhengyuan Yang, et al. ∙

research

∙ 11/25/2021

SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning

The canonical approach to video captioning dictates a caption generation...

29 Kevin Lin, et al. ∙

research

∙ 11/23/2021

Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling

In this paper, we propose UNICORN, a vision-language (VL) model that uni...

7 Zhengyuan Yang, et al. ∙

research

∙ 09/25/2019

UNITER: Learning UNiversal Image-TExt Representations

Joint image-text embedding is the bedrock for most Vision-and-Language (...

0 Yen-Chun Chen, et al. ∙

research

∙ 11/15/2017

BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems

We present a new algorithm that significantly improves the efficiency of...

0 Zachary Lipton, et al. ∙

research

∙ 09/03/2016

Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access

This paper proposes KB-InfoBot -- a multi-turn dialogue agent which help...

0 Bhuwan Dhingra, et al. ∙

Success!

An error occurred