Evade ChatGPT Detectors via A Single Space

07/05/2023
by   Shuyang Cai, et al.
0

ChatGPT brings revolutionary social value but also raises concerns about the misuse of AI-generated content. Consequently, an important question is how to detect whether content is generated by ChatGPT or by human. Existing detectors are built upon the assumption that there are distributional gaps between human-generated and AI-generated content. These gaps are typically identified using statistical information or classifiers. Our research challenges the distributional gap assumption in detectors. We find that detectors do not effectively discriminate the semantic and stylistic gaps between human-generated and AI-generated content. Instead, the "subtle differences", such as an extra space, become crucial for detection. Based on this discovery, we propose the SpaceInfi strategy to evade detection. Experiments demonstrate the effectiveness of this strategy across multiple benchmarks and detectors. We also provide a theoretical explanation for why SpaceInfi is successful in evading perplexity-based detection. Our findings offer new insights and challenges for understanding and constructing more applicable ChatGPT detectors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2023

Evading Watermark based Detection of AI-Generated Content

A generative AI model – such as DALL-E, Stable Diffusion, and ChatGPT – ...
research
04/06/2023

GPT detectors are biased against non-native English writers

The rapid adoption of generative language models has brought about subst...
research
04/10/2023

On the Possibilities of AI-Generated Text Detection

Our work focuses on the challenge of detecting outputs generated by Larg...
research
04/04/2023

To ChatGPT, or not to ChatGPT: That is the question!

ChatGPT has become a global sensation. As ChatGPT and other Large Langua...
research
05/18/2023

Large Language Models can be Guided to Evade AI-Generated Text Detection

Large Language Models (LLMs) have demonstrated exceptional performance i...
research
04/11/2023

Evaluating AIGC Detectors on Code Content

Artificial Intelligence Generated Content (AIGC) has garnered considerab...
research
11/09/2018

Securing Behavior-based Opinion Spam Detection

Reviews spams are prevalent in e-commerce to manipulate product ranking ...

Please sign up or login with your details

Forgot password? Click here to reset