Testing OpenAI’s New AI Text Classifier For Identifying AI-Written Content

I took human and AI generated text from various sources, including LLMs and submitted it to the OpenAI Classifier. The objective was to gauge the classifier’s ability to detect the origin of text content.

Text Generated Via The Cohere LLM

In the image below you see text generated in the Cohere playground…the engineered prompt is indicated by the red arrow. In other words, the instruction given to the LLM; the input.

Text generated in the Cohere Playground are submitted here to the AI Text Classifier of OpenAI.

Text Generated Via AI21Labs

The same generation command was issued in the AI21Labs playground…asking the AI21Labs LLM to generate text on the importance of punctuality.

Text generated in the AI21Labs Playground are submitted here to the AI Text Classifier of OpenAI.

ChatGPT

Below you see context generated by ChatGPT…and is rated as possibly by the classifier. Hence being seen one step closer to human generated text as apposed to Cohere and AI21Labs.

OpenAI text-davinci-003 Model

I also submitted a 500 word text generation by text-davinci-003 on the topic of punctuality and received the same answer from ChatGPT; Possibly AI-generated.

An Essay From The Web

I copied a piece from an online essay, and the result from the classifier is ambitious to some degree, but fairly accurate.

My Own Writing

Below is a original piece I wrote on the same subject, which was marked by OpenAI as possibly AI-generated. I would expect a result of Unclear if it is AI-generated.

Wikipedia

Considering that the AI Text Classifier was trained on Wikipedia, I copied a piece from Wikipedia on World War I and asked the classifier to vet the contents. Here I got the right answer, and also the highest ranking of very unlikely.

Can ChatGPT Detect Text Origins?

The short answer is…yes.

Keep In Mind

Apart from the accuracy issues stated at the beginning of this article, there are other limitations…

The Data

OpenAI collected a dataset of AI-generated and human-written text.

In Conclusion

It is evident that the accuracy of the classifier is not where it should be, and OpenAI states this fact openly: “Our classifier is not fully reliable”.

--

--

Chief Evangelist @ HumanFirst. I explore and write about all things at the intersection of AI and language; NLP/NLU/LLM, Chat/Voicebots, CCAI. www.humanfirst.ai

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Cobus Greyling

Chief Evangelist @ HumanFirst. I explore and write about all things at the intersection of AI and language; NLP/NLU/LLM, Chat/Voicebots, CCAI. www.humanfirst.ai