HIVE: Harnessing Human Feedback for Instructional Visual Editing

03/16/2023
by   Shu Zhang, et al.
1

Incorporating human feedback has been shown to be crucial to align text generated by large language models to human preferences. We hypothesize that state-of-the-art instructional image editing models, where outputs are generated based on an input image and an editing instruction, could similarly benefit from human feedback, as their outputs may not adhere to the correct instructions and preferences of users. In this paper, we present a novel framework to harness human feedback for instructional visual editing (HIVE). Specifically, we collect human feedback on the edited images and learn a reward function to capture the underlying user preferences. We then introduce scalable diffusion model fine-tuning methods that can incorporate human preferences based on the estimated reward. Besides, to mitigate the bias brought by the limitation of data, we contribute a new 1M training dataset, a 3.6K reward dataset for rewards learning, and a 1K evaluation dataset to boost the performance of instructional image editing. We conduct extensive empirical experiments quantitatively and qualitatively, showing that HIVE is favored over previous state-of-the-art instructional image editing approaches by a large margin.

READ FULL TEXT

page 1

page 2

page 6

page 7

page 8

page 12

page 13

page 17

research
11/17/2022

InstructPix2Pix: Learning to Follow Image Editing Instructions

We propose a method for editing images from human instructions: given an...
research
09/20/2023

XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates

Text editing is a crucial task that involves modifying text to better al...
research
06/05/2019

Visual Story Post-Editing

We introduce the first dataset for human edits of machine-generated visu...
research
05/24/2023

Analyzing Influential Factors in Human Preference Judgments via GPT-4

Pairwise human judgments are pivotal in guiding large language models (L...
research
05/04/2023

ChatGPT-steered Editing Instructor for Customization of Abstractive Summarization

Tailoring outputs of large language models, such as ChatGPT, to specific...
research
07/20/2023

OBJECT 3DIT: Language-guided 3D-aware Image Editing

Existing image editing tools, while powerful, typically disregard the un...
research
05/17/2021

SHARE: a System for Hierarchical Assistive Recipe Editing

We introduce SHARE: a System for Hierarchical Assistive Recipe Editing t...

Please sign up or login with your details

Forgot password? Click here to reset