Corrective RAG (CRAG)

By now, RAG is an accepted and well established standard for addressing data relevance for in-context learning. But there are concerns regarding model behaviour when inaccurate data is retrieved.

5 min readFeb 5, 2024

Introduction

There are a number of developments taking place in terms of RAG.

The one exciting development is that of Agentic RAG. Agentic RAG is where a hierarchy of agents are use to perform RAG; this method allows for a multilayered approach where searches are performed across document sources. Various sources are synthesised into one succinct and concise response.

There has also been numerous studies published considering the triage of retrieved information together with in-context learning.

Corrective Retrieval Augmented Generation (CRAG) is proposed to enhance the robustness of generation when errors in retrieval are introduced.

Part of CRAG, is a lightweight retrieval evaluator which assesses the overall quality of retrieved documents, providing a confidence degree to trigger different knowledge retrieval actions.

Also, to address limitations in static and limited corpora, large-scale web searches are used to augment retrieval results.

CRAG employs a decompose-then-recompose algorithm for retrieved documents, allowing selective focus on key information and filtering out irrelevant details.

CRAG is designed to be plug-and-play and seamlessly integrate with various RAG-based approaches.

Experiments conducted during the study using four datasets covering short- and long-form generation tasks show that CRAG significantly enhances the performance of RAG-based approaches.

CRAG Overview

Considering the diagram below, a retrieval evaluator is constructed to evaluate the relevance of the retrieved documents to the input. An estimation of the degree of confidence is made based on which different knowledge retrieval actions of Correct, Incorrect, Ambiguous can be triggered.

The proposed method is named Corrective Retrieval-Augmented Generation (CRAG), and it is aimed to self-correct retriever results and enhance document utilisation for generation.

A lightweight retrieval evaluator is introduced to assess the overall quality of retrieved documents for a given query.

This evaluator is a crucial component in Retrieval-Augmented Generation (RAG), contributing to informative generation by reviewing and evaluating the relevance and reliability of retrieved documents.

The retrieval evaluator quantifies a confidence degree, enabling different knowledge retrieval actions such as Correct, Incorrect, Ambiguous based on the assessment.

For Incorrect and Ambiguous cases, large-scale web searches are integrated strategically to address limitations in static and limited corpora, aiming to provide a broader and more diverse set of information.

Lastly…a decompose-then-recompose algorithm is implemented throughout the retrieval and utilisation process.

This algorithm helps eliminate redundant contexts in retrieved documents that are unhelpful for RAG, refining the information extraction process and optimising the inclusion of key insights while minimising non-essential elements.

Retrieval Evaluator

From the diagram above, it is clear that the accuracy of the retrieval evaluator plays a significant role in determining the performance of the entire system.

This algorithm ensures the refinement of retrieved information, optimising the extraction of key insights and minimizing the inclusion of non-essential elements, thereby enhancing the utilization of retrieved data.

Conclusion

This study focus on challenges faced by Retrieval-Augmented Generation (RAG) approaches when retrieval results are inaccurate, leading to the generation of incorrect knowledge by language models.

The proposed solution is something the researches named Corrective Retrieval Augmented Generation (CRAG).

The research introduces CRAG as a plug-and-play solution to enhance the robustness of generation by mitigating issues arising from inaccurate retrieval.

CRAG incorporates a retrieval evaluator designed to estimate and trigger three different actions.

The inclusion of web search and optimised knowledge utilisation operations in CRAG improves automatic self-correction and enhances the efficient utilisation of retrieved documents.

Experiments combining CRAG with standard RAG and Self-RAG showcase its adaptability to various RAG-based approaches.

The study, conducted on four datasets, illustrates CRAG’s generalisability across both short- and long-form generation tasks.

⭐️ Follow me on LinkedIn for updates on Large Language Models ⭐️

I’m currently the Chief Evangelist @ Kore AI. I explore & write about all things at the intersection of AI & language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces & more.

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

cobusgreyling.medium.com

Corrective Retrieval Augmented Generation

Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured…

arxiv.org

UniMS-RAG: Unified Multi-Source RAG for Personalised Dialogue

Considerable development has taken place in the area of RAG, especially in adding structure and multi-document…

cobusgreyling.medium.com

Agentic RAG With LlamaIndex

The topic of Agentic RAG explores how agents can be incorporated into existing RAG pipelines for enhanced…

blog.llamaindex.ai

LLamaIndex Agentic RAG Demo

Agentic RAG is an agent based approach to perform question answering over multiple documents in an orchestrated…

cobusgreyling.medium.com

Corrective RAG (CRAG)

By now, RAG is an accepted and well established standard for addressing data relevance for in-context learning. But there are concerns regarding model behaviour when inaccurate data is retrieved.

Introduction

CRAG Overview

Retrieval Evaluator

Conclusion

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

Corrective Retrieval Augmented Generation

Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured…

UniMS-RAG: Unified Multi-Source RAG for Personalised Dialogue

Considerable development has taken place in the area of RAG, especially in adding structure and multi-document…

Agentic RAG With LlamaIndex

The topic of Agentic RAG explores how agents can be incorporated into existing RAG pipelines for enhanced…

LLamaIndex Agentic RAG Demo

Agentic RAG is an agent based approach to perform question answering over multiple documents in an orchestrated…

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by Cobus Greyling

Responses (1)