LLM Hallucination Correction Via Training-Time Correction, Generation-Time Correction & Augmentation Tools.

These methods are not mutually exclusive, and can be implemented in parallel for highly scaleable enterprise implementations.

4 min readOct 5, 2023

The advent of LLMs, Foundation Models and Generative AI have given rise to a gold rush of sorts, with companies in a mad dash to develop the ultimate product to leverage the power of these models.

This gave rise to ambitious marketing (to say the least) and a susceptibility to identify one product or a single approach which will solve for all LLM implementation challenges.

The reality is that there is no elixir of sorts to remedy all implementation challenges; the solution most probably lies with a combination of technologies and principles.

This article covers three identified and accepted building blocks for LLM-based implementations; which can be used in concert or alone.

Training Time Correction

This approach is focused on a model level, where the model is fine-tuned with custom data.

OpenAI GPT-3.5 Turbo Model Fine-Tuning

This article considers the process, speed and data requirements to create a fine-tuned model. On 22 August 2023 OpenAI…

cobusgreyling.medium.com

Generation Time

Generation Time can also be referred to as inference time.

In generation time correction, a common theme is to make reasoning decisions on top of the base LLM in order to make them more reliable.

Another promising approach to rectify these flaws is self-correction, where the LLM itself is prompted or guided to fix problems in its own output.

Techniques leveraging automated feedback — either produced by the LLM itself or some external system, are of particular interest as they are a promising way to make LLM-based solutions more practical and deployable with minimal human feedback.

12 Prompt Engineering Techniques

Prompt Engineering can be described as an art form, creating input requests for Large Language Models (LLMs) that will…

cobusgreyling.medium.com

COVE also uses a related self-consistency approach, but without the multi-agent (multi-LLM) debate concept. Read more here.

Augmentation Tools

A third approach is to use external tools to help mitigate hallucinations, rather than relying solely on the abilities of the language model itself.

For example, retrieval-augmented generation can decrease hallucinations by using factual documents for grounding or chain-of-thought verification.

Other approaches include using tools for fact-checking or linking to external documents with attribution.

A majority of the methods for reducing hallucination can be divided into roughly three categories: training-time correction, generation-time correction and via augmentation (tool-use). ~ Source

Steps In Evaluating Retrieval Augmented Generation (RAG) Pipelines

The basic principle of RAG is to leverage external data sources. For each user query or question, a contextual chunk of…

cobusgreyling.medium.com

⭐️ Follow me on LinkedIn for updates on Large Language Models ⭐️

I explore & write about all things at the intersection of AI & language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces & more.

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

cobusgreyling.medium.com

Chain-of-Verification Reduces Hallucination in Large Language Models

Generation of plausible yet incorrect factual information, termed hallucination, is an unsolved issue in large language…

arxiv.org

Improving Factuality of Abstractive Summarization via Contrastive Reward Learning

Modern abstractive summarization models often generate summaries that contain hallucinated or contradictory…

arxiv.org

LLM Hallucination Correction Via Training-Time Correction, Generation-Time Correction & Augmentation Tools.

These methods are not mutually exclusive, and can be implemented in parallel for highly scaleable enterprise implementations.

Training Time Correction

OpenAI GPT-3.5 Turbo Model Fine-Tuning

This article considers the process, speed and data requirements to create a fine-tuned model. On 22 August 2023 OpenAI…

Generation Time

12 Prompt Engineering Techniques

Prompt Engineering can be described as an art form, creating input requests for Large Language Models (LLMs) that will…

Augmentation Tools

Steps In Evaluating Retrieval Augmented Generation (RAG) Pipelines

The basic principle of RAG is to leverage external data sources. For each user query or question, a contextual chunk of…

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

Chain-of-Verification Reduces Hallucination in Large Language Models

Generation of plausible yet incorrect factual information, termed hallucination, is an unsolved issue in large language…

Improving Factuality of Abstractive Summarization via Contrastive Reward Learning

Modern abstractive summarization models often generate summaries that contain hallucinated or contradictory…

Written by Cobus Greyling

No responses yet