Data Delivery To Large Language Models

I believe there are four dimensions to data when it comes to LLMs. In this article I focus on one of those four sides; named Data Delivery.

5 min readOct 27, 2023

Some Background

Data Delivery can best described as the process of imbuing one or more models with data relevant to the use-case, industry and specific user context at inference.

The data is used by the LLM to deliver accurate responses in each and every instance.

Often the various methods of data delivery are considered as mutually exclusive with one approach being considered as the ultimate solution. This point of view is often driven by ignorance, a lack of understanding, organisation searching for a stop-gap solution or a vendor pushing their specific product as the silver bullet.

The truth is that for an enterprise implementation flexibility and manageability will necessitate complexity.

This holds true for any LLM implementation and the approach followed to deliver data to the LLM. The answer is not one specific approach, like RAG, or prompt chaining; but rather a balanced multi-pronged approach.

Model Training

Data can be delivered to a LLM at two stages, during model training (gradient) or at inference time (gradient-free).

Model training creates, changes and adapts the underlying ML model. This model is also referred to as a model frozen in time with a definite time-stamp.

Model training can be again be divided into two sub-categories;

Meta Training & Meta Learning

Meta Training

Meta-training is not something an organisation will perform; generally, I would say. It is rather the process used by model providers to firstly create models and secondly create models of different sizes and different optimisations.

Meta-training typically involves the initial pre-training of a LLM on a massive corpus of text data.

In this phase, the model learns to understand the structure and patterns of language, building a strong foundation of general knowledge and language understanding; going through a pre-training phase where it learns from a wide range of internet text.

The term “meta” in meta-training is used because this training doesn’t involve a specific task or domain adaptation. Instead, it prepares the model to be a versatile language understanding tool that can later be fine-tuned for various tasks or domains. It’s the base training that precedes task-specific learning.

Meta Learning

Meta-learning is the process of fine-tuning an existing LLM for a specific task or domain. This could be considered as the second phase of training, following the meta-training.

During meta-learning, the model is trained on task-specific or domain-specific data and adjusts its parameters to perform well on that specific task or within that specific domain.

Meta-learning can be performed by fine-tuning the model, prompt tuning or a new approach by DeepMind PromptBreeder.

While fine-tuning takes place at a certain point in time, and produces a frozen model which is then referenced over time; approaches like prompt tuning or soft prompts introduce a more flexible and dynamic way of guiding the model.

Instead of a fixed prompt, users provide high-level instructions or hints to guide the model responses.

Inference Time Training

Inference is the moment the LLM is queried and where the model subsequently generates a response. This is also referred to as a gradient-free approach due to the fact that the underlying model is not trained or changed.

Recent research and studies have found that providing context at inference is of utmost importance and various methods are being followed to deliver highly contextual reference data with the prompt to negate hallucination. This is also referred to as prompt injection.

Context and conversational structured can be delivered via RAG, prompt pipelines, Autonomous Agents, Prompt Chaining and prompt engineering techniques.

⭐️ Follow me on LinkedIn for updates on Large Language Models ⭐️

I’m currently the Chief Evangelist @ Kore AI. I explore & write about all things at the intersection of AI & language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces & more.

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

cobusgreyling.medium.com

LLM Hallucination Correction Via Training-Time Correction, Generation-Time Correction &…

These methods are not mutually exclusive, and can be implemented in parallel for highly scaleable enterprise…

cobusgreyling.medium.com

Large Language Model (LLM) Disruption of Chatbots

To understand the disruption and demands Large Language Models will place on the Conversational AI/UI ecosystem going…

cobusgreyling.medium.com

Updated: Emerging RAG & Prompt Engineering Architectures for LLMs

Large Language Models (LLMs) depend on unstructured data for input and output data is also unstructured and…

cobusgreyling.medium.com

OpenAI GPT-3.5 Turbo Model Fine-Tuning

This article considers the process, speed and data requirements to create a fine-tuned model. On 22 August 2023 OpenAI…

cobusgreyling.medium.com

12 Prompt Engineering Techniques

Prompt Engineering can be described as an art form, creating input requests for Large Language Models (LLMs) that will…

cobusgreyling.medium.com

Steps In Evaluating Retrieval Augmented Generation (RAG) Pipelines

The basic principle of RAG is to leverage external data sources. For each user query or question, a contextual chunk of…

cobusgreyling.medium.com

Mitigating Hallucination

Search engines & applications are attached to data sources. Even-though Large Language Models (LLMs) hold vast amounts…

cobusgreyling.medium.com

Data Delivery To Large Language Models

I believe there are four dimensions to data when it comes to LLMs. In this article I focus on one of those four sides; named Data Delivery.

Some Background

Model Training

Meta Training & Meta Learning

Meta Training

Meta Learning

Inference Time Training

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

LLM Hallucination Correction Via Training-Time Correction, Generation-Time Correction &…

These methods are not mutually exclusive, and can be implemented in parallel for highly scaleable enterprise…

Large Language Model (LLM) Disruption of Chatbots

To understand the disruption and demands Large Language Models will place on the Conversational AI/UI ecosystem going…

Updated: Emerging RAG & Prompt Engineering Architectures for LLMs

Large Language Models (LLMs) depend on unstructured data for input and output data is also unstructured and…

OpenAI GPT-3.5 Turbo Model Fine-Tuning

This article considers the process, speed and data requirements to create a fine-tuned model. On 22 August 2023 OpenAI…

12 Prompt Engineering Techniques

Prompt Engineering can be described as an art form, creating input requests for Large Language Models (LLMs) that will…

Steps In Evaluating Retrieval Augmented Generation (RAG) Pipelines

The basic principle of RAG is to leverage external data sources. For each user query or question, a contextual chunk of…

Mitigating Hallucination

Search engines & applications are attached to data sources. Even-though Large Language Models (LLMs) hold vast amounts…

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by Cobus Greyling

No responses yet