The Introduction Of Chat Markup Language (ChatML) Is Important For A Number Of Reasons

On 1 March 2023 OpenAI introduced the ChatGPT and Whisper APIs. Part of this announcement was Chat Markup Langauge which seems to have gone largely unnoticed. Here I discuss why ChatML is an important development…

6 min readMar 2, 2023

I’m currently the Chief Evangelist @ HumanFirst. I explore and write about all things at the intersection of AI and language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces and more.

Short Recap…

The OpenAI announcement centred around a few main points:

🚀 The significant drop in price for a hosted API, there has been a 90% cost reduction for ChatGPT since December 2022.

🚀 The APIs hosted via Azure will most probably come with very granular management, and regional and geographic availability zones. This speaks to significant potential value-add to the APIs.

🚀 The pressure on ASR suppliers are mounting, differentiation will have to be established via stellar and personal support, granular fine-tuning, support for niche minority languages, etc.

🚀 The Whisper and ChatGPT APIs are allowing for ease of implementation and experimentation. Ease of access to Whisper enable expanded use of ChatGPT in terms of including voice data and not only text.

🚀 Allowing you to access a specific model version and then upgrade when required exposes changes and updates to models. This introduces stability for production implementations.

🚀 These changes are indicative of the increasing maturity of the LLM environments.

Back to Chat Markup Langauge (ChatML)

I believe the introduction of ChatML is extremely significant and important for the following reasons…

⚙️ The main security vulnerability and avenue of abuse for LLMs has been prompt injection attacks. ChatML is going to allow for protection against these types of attacks.

⚙️ To negate prompt injection attacks, the conversation is segregated into the layers or roles of:

System
assistant
user, etc.

⚙️ This is only version 0 of ChatML, and significant development is promised for this language.

⚙️The payload accommodated for in ChatML is currently only text. OpenAI foresee the introduction of other datatypes. This is in keeping with the notion of Large Foundation Models to soon start combining text, images, sound, etc.

Users can still use the unsafe raw string format. But again, this format inherently allows injections.

⚙️ OpenAI is in the ideal position to steer and manage the LLM landscape in a responsible manner. Laying down foundational standards for creating applications.

ChatML makes explicit to the model the source of each piece of text, and particularly shows the boundary between human and AI text.
This gives an opportunity to mitigate and eventually solve injections, as the model can tell which instructions come from the developer, the user, or its own input. ~ OpenAI

ChatML Example Code

Below is a ChatML example JSON file with the roles defined of system, user and assistant.

[{"role": "system", 
      "content" : "You are ChatGPT, a large language model trained by OpenAI. Answer as concisely as possible.\nKnowledge cutoff: 2021-09-01\nCurrent date: 2023-03-02"},
 {"role": "user", 
      "content" : "How are you?"},
 {"role": "assistant", 
      "content" : "I am doing well"},
 {"role": "user", 
      "content" : "What is the mission of the company OpenAI?"}]

And the working Python code snippet:

pip install openai

import os
import openai
openai.api_key = "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"

completion = openai.ChatCompletion.create(
  model="gpt-3.5-turbo", 
  messages = [{"role": "system", "content" : "You are ChatGPT, a large language model trained by OpenAI. Answer as concisely as possible.\nKnowledge cutoff: 2021-09-01\nCurrent date: 2023-03-02"},
{"role": "user", "content" : "How are you?"},
{"role": "assistant", "content" : "I am doing well"},
{"role": "user", "content" : "What is the mission of the company OpenAI?"}]
)
#print(completion)
print(completion)

With the output below, notice the role which is defined, the model detail which is gpt-3.5-turbo-0301 and other detail.

{
  "choices": [
    {
      "finish_reason": "stop",
      "index": 0,
      "message": {
        "content": "The mission of OpenAI is to ensure that artificial intelligence (AI) benefits humanity as a whole, by developing and promoting friendly AI for everyone, researching and mitigating risks associated with AI, and helping shape the policy and discourse around AI.",
        "role": "assistant"
      }
    }
  ],
  "created": 1677751157,
  "id": "chatcmpl-6pa0TlU1OFiTKpSrTRBbiGYFIl0x3",
  "model": "gpt-3.5-turbo-0301",
  "object": "chat.completion",
  "usage": {
    "completion_tokens": 50,
    "prompt_tokens": 84,
    "total_tokens": 134
  }
}

In Closing

One of the challenges of building a conversational interface based on LLMs, is the notion sequencing prompt nodes into chains.

The edges, which sits between the nodes, is hard to manage due to the unstructured nature of the input. And the input is usually in natural langauge or conversational, which is inherently unstructured.

ChatML will greatly assist in creating a standard target for data transformation for submission to a chain.

⭐️ Please follow me on LinkedIn for updates on Conversational AI ⭐️

I’m currently the Chief Evangelist @ HumanFirst. I explore and write about all things at the intersection of AI and language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces and more.

NLU design tooling

“Conversation Designer, Retail, 10k+ employees The tool that turned conversation designers, into NLU designers” ★★★★★…

www.humanfirst.ai

https://www.linkedin.com/in/cobusgreyling

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

cobusgreyling.medium.com

Eliza Language Technology Community — Language Technology: Conversational AI, NLP/NLP, CCAI…

ELIZA — Where language technology enthusiasts unite.

eliza.community

The Cobus Quadrant™ Of NLU Design

NLU design is vital to planning and continuously improving Conversational AI experiences.

cobusgreyling.medium.com

The Cobus Quadrant™ Of Conversation Design Capabilities

∗ This is part one of a two part series, please also take a look part two, the Cobus Quadrant of NLU Design.

cobusgreyling.medium.com

Large Language Models Are Forcing Conversational AI Frameworks To Look Outward

With fragmentation being forced on frameworks it will become increasingly hard to be self-contained. I also consider…

cobusgreyling.medium.com

Existing Rigid Chatbot Architecture Needs Large Language Model (LLM) Flexibility

In recent posts I have been exploring the impact of LLMs on Conversational AI in general…but in this article I want to…

cobusgreyling.medium.com

Large Language Models (LLMs) Will Not Replace Traditional Chatbot NLU…For Now

Traditional NLU pipelines are well optimised and excel at extremely granular fine-tuning of intents and entities at no…

cobusgreyling.medium.com

Generative AI & The New Category of LLM Powered Applications

New methods and applications are surfacing to implement conversational experiences by leveraging the power of…

cobusgreyling.medium.com

The Anatomy Of Large Language Model (LLM) Powered Conversational Applications

True business value needs to be added to LLM API calls to make any LLM based application successful.

cobusgreyling.medium.com

The Foundation Large Language Model (LLM) & Tooling Landscape

There is an ever growing list of Generative AI Applications, which can be broken down into eight broad categories.

cobusgreyling.medium.com

openai-python/chatml.md at main · openai/openai-python

Traditionally, GPT models consumed unstructured text. ChatGPT models instead expect a structured format, called Chat…

github.com

Introducing ChatGPT and Whisper APIs

Jeff Belgum, Jake Berdine, Brooke Chan, Che Chang, Derek Chen, Ruby Chen, Thomas Degry, Steve Dowling, Sheila Dunning…

openai.com

The Introduction Of Chat Markup Language (ChatML) Is Important For A Number Of Reasons

On 1 March 2023 OpenAI introduced the ChatGPT and Whisper APIs. Part of this announcement was Chat Markup Langauge which seems to have gone largely unnoticed. Here I discuss why ChatML is an important development…

Short Recap…

Back to Chat Markup Langauge (ChatML)

ChatML Example Code

In Closing

NLU design tooling

“Conversation Designer, Retail, 10k+ employees The tool that turned conversation designers, into NLU designers” ★★★★★…

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

Eliza Language Technology Community — Language Technology: Conversational AI, NLP/NLP, CCAI…

ELIZA — Where language technology enthusiasts unite.

The Cobus Quadrant™ Of NLU Design

NLU design is vital to planning and continuously improving Conversational AI experiences.

The Cobus Quadrant™ Of Conversation Design Capabilities

∗ This is part one of a two part series, please also take a look part two, the Cobus Quadrant of NLU Design.

Large Language Models Are Forcing Conversational AI Frameworks To Look Outward

With fragmentation being forced on frameworks it will become increasingly hard to be self-contained. I also consider…

Existing Rigid Chatbot Architecture Needs Large Language Model (LLM) Flexibility

In recent posts I have been exploring the impact of LLMs on Conversational AI in general…but in this article I want to…

Large Language Models (LLMs) Will Not Replace Traditional Chatbot NLU…For Now

Traditional NLU pipelines are well optimised and excel at extremely granular fine-tuning of intents and entities at no…

Generative AI & The New Category of LLM Powered Applications

New methods and applications are surfacing to implement conversational experiences by leveraging the power of…

The Anatomy Of Large Language Model (LLM) Powered Conversational Applications

True business value needs to be added to LLM API calls to make any LLM based application successful.

The Foundation Large Language Model (LLM) & Tooling Landscape

There is an ever growing list of Generative AI Applications, which can be broken down into eight broad categories.

openai-python/chatml.md at main · openai/openai-python

Traditionally, GPT models consumed unstructured text. ChatGPT models instead expect a structured format, called Chat…

Introducing ChatGPT and Whisper APIs

Jeff Belgum, Jake Berdine, Brooke Chan, Che Chang, Derek Chen, Ruby Chen, Thomas Degry, Steve Dowling, Sheila Dunning…

Written by Cobus Greyling