Trapping LLM Hallucinations Using Tagged Context Prompts

06/09/2023
by   Philip Feldman, et al.
0

Recent advances in large language models (LLMs), such as ChatGPT, have led to highly sophisticated conversation agents. However, these models suffer from "hallucinations," where the model generates false or fabricated information. Addressing this challenge is crucial, particularly with AI-driven platforms being adopted across various sectors. In this paper, we propose a novel method to recognize and flag instances when LLMs perform outside their domain knowledge, and ensuring users receive accurate information. We find that the use of context combined with embedded tags can successfully combat hallucinations within generative language models. To do this, we baseline hallucination frequency in no-context prompt-response pairs using generated URLs as easily-tested indicators of fabricated data. We observed a significant reduction in overall hallucination when context was supplied along with question prompts for tested generative engines. Lastly, we evaluated how placing tags within contexts impacted model responses and were able to eliminate hallucinations in responses with 98.88

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2023

Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models

Most open-domain dialogue systems suffer from forgetting important infor...
research
04/04/2023

Geotechnical Parrot Tales (GPT): Harnessing Large Language Models in geotechnical engineering

The widespread adoption of large language models (LLMs), such as OpenAI'...
research
07/20/2023

The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models

Generative language models produce highly abstractive outputs by design,...
research
08/31/2023

LLM in the Shell: Generative Honeypots

Honeypots are essential tools in cybersecurity. However, most of them (e...
research
07/07/2021

Not Quite 'Ask a Librarian': AI on the Nature, Value, and Future of LIS

AI language models trained on Web data generate prose that reflects huma...
research
11/09/2022

Large Language Models with Controllable Working Memory

Large language models (LLMs) have led to a series of breakthroughs in na...

Please sign up or login with your details

Forgot password? Click here to reset