Agents, LLMs & Multihop Question Answering

Within a development framework, Agents can have access to a suite of tools; based on the user input and demands. Where predetermined chains do not exist, Agents can service the user request.

Cobus Greyling

5 min readApr 21, 2023

Setting The Scene

Accessing and developing on Large Language Models (LLMs) is an ever expanding field…

However, there are a few principles emerging which is being widely implemented.

This image shows the current development stack for LLMs and the development affordances available.

Considering the image above, the four layers just above the LLM are all directly related to different approaches and implementations of prompts.

There has been an evolution in prompt implementation methods.

Prompt chaining is the process of chaining or sequencing a number of prompts to create a larger application. The prompt sequences can be arranged in series or parallel.

But, one of the impediments of prompt chaining, is that chaining is a predetermined flow or sequence of calls to LLMs and other APIs.

What about potential unknown scenarios or user behaviour for which no chain has been developed? Can there be a level of autonomy, where the LLM based application can decide which route will service the user request best? Enter Agents.

Considering an Agent which has access to an extractive model (prompt pipeline), trained on a corpus of data containing information on US presidents…

The extractive model based on the document store does well at the following straight-forward question:

Who was the 1st president of the USA?

But the Agent solely based on an extractive model does not do well to find answers to a question which does not clearly match a phrase in the document store.

In the example below, an ambiguous question is posed to the Agent:

What year was the 1st president of the USA born?

This can be considered as a multihop question, which demands a level of deduction and reasoning.

If the Agent has access to the document store (extractive model) and a LLM, the Agent can decide which tool from its arsenal to use to best service the user request.

Leveraging the LLM, the Agent can also follow a multihop approach to answering the question, by following chain-of-thought reasoning to answer the question.

Below is the response of the Haystack Agent to the question, notice how the question is decomposed, and notice how a Chain-Of-Thought Prompting approach is followed.

Finally

In the case of the Haystack Agent, the exact wording of the description is really important. The Agent leverages this description to understand which tool to use.

If the agent fails to pick the right tool, you can adjust the description.

I changed the description from:

useful for when you need to answer questions related to the presidents of the USA.

to:

useful for when you need to answer questions related to the presidents of the USA when there is no answer in the document store.

Below the code snippet…

from haystack.agents import Tool

search_tool = Tool(name="USA_Presidents_QA",pipeline_or_node=presidents_qa,
                  description="useful for when you need to answer questions related to the presidents of the USA when there is no answer in the document store.",
                  output_variable="answers"
              )
agent.add_tool(search_tool)

When considering LLM based applications, it is evident that orchestration will play a large role, together with an appropriate balance between prompt pipelines, prompt chaining and agents.

⭐️ Please follow me on LinkedIn for updates on Conversational AI ⭐️

I’m currently the Chief Evangelist @ HumanFirst. I explore and write about all things at the intersection of AI and language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces and more.

NLU design tooling

“Conversation Designer, Retail, 10k+ employees The tool that turned conversation designers, into NLU designers” ★★★★★…

www.humanfirst.ai

https://www.linkedin.com/in/cobusgreyling

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

cobusgreyling.medium.com

The Cobus Quadrant™ Of NLU Design

NLU design is vital to planning and continuously improving Conversational AI experiences.

cobusgreyling.medium.com

The Cobus Quadrant™ Of Conversation Design Capabilities

∗ This is part one of a two part series, please also take a look part two, the Cobus Quadrant of NLU Design.

cobusgreyling.medium.com

Scaleable Prompt Pipelines For LLMs

In a previous post I wrote about the evolution of prompt engineering. Creating highly scaleable & enterprise grade LLM…

cobusgreyling.medium.com

The Evolution Of Prompt Engineering

As functionality grows, invariably the complexity of prompt engineering will grow. Here I explain how complexity can be…

cobusgreyling.medium.com

Generative AI Prompt Pipelines

Prompt Pipelines extend prompt templates by automatically injecting contextual reference data for each prompt.

cobusgreyling.medium.com

Prompt Chaining

There is an emergence of Visual Programming tools facilitating the chaining of large language model prompts into an…

cobusgreyling.medium.com

Answering Multihop Questions with Agents | Haystack

Level: Intermediate Time to complete: 10 minutes Nodes Used: Agent, PromptNode, InMemoryDocumentStore, FARMReader and…

haystack.deepset.ai

Google Colaboratory

Edit description

colab.research.google.com

Tuana/presidents · Datasets at Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Agents, LLMs & Multihop Question Answering

Within a development framework, Agents can have access to a suite of tools; based on the user input and demands. Where predetermined chains do not exist, Agents can service the user request.

Setting The Scene

Finally

NLU design tooling

“Conversation Designer, Retail, 10k+ employees The tool that turned conversation designers, into NLU designers” ★★★★★…

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

The Cobus Quadrant™ Of NLU Design

NLU design is vital to planning and continuously improving Conversational AI experiences.

The Cobus Quadrant™ Of Conversation Design Capabilities

∗ This is part one of a two part series, please also take a look part two, the Cobus Quadrant of NLU Design.

Scaleable Prompt Pipelines For LLMs

In a previous post I wrote about the evolution of prompt engineering. Creating highly scaleable & enterprise grade LLM…

The Evolution Of Prompt Engineering

As functionality grows, invariably the complexity of prompt engineering will grow. Here I explain how complexity can be…

Generative AI Prompt Pipelines

Prompt Pipelines extend prompt templates by automatically injecting contextual reference data for each prompt.

Prompt Chaining

There is an emergence of Visual Programming tools facilitating the chaining of large language model prompts into an…

Answering Multihop Questions with Agents | Haystack

Level: Intermediate Time to complete: 10 minutes Nodes Used: Agent, PromptNode, InMemoryDocumentStore, FARMReader and…

Google Colaboratory

Edit description

Tuana/presidents · Datasets at Hugging Face

We're on a journey to advance and democratize artificial intelligence through open source and open science.

Written by Cobus Greyling