What Does The OpenAI 16K Context Window Mean?

OpenAI launched a new model called gpt-3.5-turbo-16k, which means you can submit a document to OpenAI of 16,000 words in one go.

4 min readJun 15, 2023

I’m currently the Chief Evangelist @ HumanFirst. I explore and write about all things at the intersection of AI and language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces and more.

In the image below, is an extract of a 12,139 words, 14 pages document which I submitted to the new OpenAI model, with the result at the bottom:

I found it intriguing that the GPT-3.5-Turbo-16k model is a chat-only model and is not supported by the Completions Endpoint. The following is the error message I received:

InvalidRequestError: This is a chat model and not supported in the v1/completions endpoint. Did you mean to use v1/chat/completions?

Given this, I had to choose the chat endpoint and the ChatML notation for input.

As an example, below is a working Python application accessing the 16k model:

import os
import openai
openai.api_key = "xxxxxxxxxxxxxxxxxxxx"

completion = openai.ChatCompletion.create(
  model="gpt-3.5-turbo-16k", 
  messages = [{"role": "system", "content" : "You are a chatbot which can search text and provide a summarised answer."},
{"role": "user", "content" : "How are you?"},
{"role": "assistant", "content" : "I am doing well"},
{"role": "user", "content" : "What is the distance between New York and Montreal?"}
  ]
)
print(completion)

When I attempted to submit a document which exceeded the maximum context window, I received an informative error message detailing what was possible from a context window perspective:

InvalidRequestError: This model’s maximum context length is 16385 tokens. However, your messages resulted in 18108 tokens. Please reduce the length of the messages.

16k context means the model can now support ~20 pages of text in a single request.
- OpenAI

The OpenAI model has alleviated one of the long-running ailments of LLMs, the limited context window size, which necessitates pre-processing of data.

The sheer context that this model can manage is both astounding and remarkable, with speeds and accuracy that make it an invaluable development.

With 256 words for a one sentence summary and 12 key points, the summary clearly details the significance of this new progress.

⭐️ Please follow me on LinkedIn for updates on Conversational AI ⭐️

I’m currently the Chief Evangelist @ HumanFirst. I explore and write about all things at the intersection of AI and language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces and more.

NLU design tooling

HumanFirst is data-centric tooling for NLU designers. Create, curate, evaluate & fine-tune long-tail NLU with 50+ NLU…

www.humanfirst.ai

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

cobusgreyling.medium.com

OpenAI Function Calling

When making an API call to models gpt-3.5-turbo-0613 & gpt-4–0613, users can describe a function. The model generates a…

cobusgreyling.medium.com

OpenAI GPT-3.5 Turbo Model With 16k Context Window

OpenAI has launched a new model called gpt-3.5-turbo-16k & I was able to submit a 14 page document to the model for…

cobusgreyling.medium.com

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Artificial Intelligence

Written by Cobus Greyling

28K Followers

0 Following

I’m passionate about exploring the intersection of AI & language. www.cobusgreyling.com

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

Cobus Greyling

Internet Browsing AI Agents Demystified

To be truly effective, AI Agents need to start living in our environments, beginning in our digital environments is the most obvious…

4d ago

Vipra Singh

AI Agents: Introduction (Part-1)

Discover AI agents, their design, and real-world applications.

Feb 2

Best Prompt Techniques for Best LLM Responses

The Modern Scientist

Jules S. Damji

Best Prompt Techniques for Best LLM Responses

Better prompts is all you need for better responses

Feb 12, 2024

Building Your First AI Agent (That Will Actually Improve You As An AI Engineer)

Level Up Coding

Dr. Ashish Bamania

Building Your First AI Agent (That Will Actually Improve You As An AI Engineer)

A guide to building a helpful Multi-agent AI system that helps you find the top 10 AI research papers published in ArXiv every day

Mar 17

LangGraph + MCP + Ollama: The Key To Powerful Agentic AI

Data Science Collective

Gao Dalie (高達烈)

LangGraph + MCP + Ollama: The Key To Powerful Agentic AI

In this story, I have a super quick tutorial showing you how to create a multi-agent chatbot using LangGraph, MCP, and Ollama to build a…

4d ago

The Model Context Protocol (MCP): The Ultimate Guide

Data And Beyond

TONI RAMCHANDANI

The Model Context Protocol (MCP): The Ultimate Guide

Introduction to the Model Context Protocol (MCP)

Mar 10

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech

What Does The OpenAI 16K Context Window Mean?

OpenAI launched a new model called gpt-3.5-turbo-16k, which means you can submit a document to OpenAI of 16,000 words in one go.

NLU design tooling

HumanFirst is data-centric tooling for NLU designers. Create, curate, evaluate & fine-tune long-tail NLU with 50+ NLU…

Get an email whenever Cobus Greyling publishes.

Get an email whenever Cobus Greyling publishes. By signing up, you will create a Medium account if you don’t already…

OpenAI Function Calling

When making an API call to models gpt-3.5-turbo-0613 & gpt-4–0613, users can describe a function. The model generates a…

OpenAI GPT-3.5 Turbo Model With 16k Context Window

OpenAI has launched a new model called gpt-3.5-turbo-16k & I was able to submit a 14 page document to the model for…

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by Cobus Greyling

No responses yet

More from Cobus Greyling

Why The Focus Has Shifted from AI Agents to Agentic Workflows

We find ourselves on a stairway from where Large Language Models were introduced to AI Agents with human like digital interactions. But…

Using LangChain With Model Context Protocol (MCP)

The Model Context Protocol (MCP) is an open-source protocol developed by Anthropic, focusing on safe and interpretable Generative AI…

AI Agents are not Ready Yet

No company wants to pour resources into developing software only to see it become irrelevant due to general advancements in AI…

Model Context Protocol (MCP)

I would like to make a point regarding the Model Context Protocol (MCP)…

Recommended from Medium

Internet Browsing AI Agents Demystified

To be truly effective, AI Agents need to start living in our environments, beginning in our digital environments is the most obvious…

AI Agents: Introduction (Part-1)

Discover AI agents, their design, and real-world applications.

Best Prompt Techniques for Best LLM Responses

Better prompts is all you need for better responses

Building Your First AI Agent (That Will Actually Improve You As An AI Engineer)

A guide to building a helpful Multi-agent AI system that helps you find the top 10 AI research papers published in ArXiv every day

LangGraph + MCP + Ollama: The Key To Powerful Agentic AI

In this story, I have a super quick tutorial showing you how to create a multi-agent chatbot using LangGraph, MCP, and Ollama to build a…

The Model Context Protocol (MCP): The Ultimate Guide

Introduction to the Model Context Protocol (MCP)