OpenAI has an AI text detector but doesn't want to release it

OpenAI
(Image credit: Getty Images)

OpenAI has developed some new tools to detect content generated by ChatGPT and its AI models, but it isn't going to deploy them just yet. The company has come up with a way to overlay AI-produced text with a kind of watermark. This embedded indicator might achieve the goal of divining when AI has written some content. However, OpenAI is hesitant to offer this as a feature when it might harm those using its models for benign purposes.

OpenAI's new method would employ algorithms capable of embedding subtle markers in text generated by ChatGPT. Though invisible to the naked eye, the tool would use a specific format of words and phrases that signal the text's origin from ChatGPT. There are obvious reasons this might be a boon in generative AI as an industry, as OpenAI points out. Watermarking could play a critical role in combating misinformation, ensuring transparency in content creation, and preserving the integrity of digital communications. It's also similar to a tactic already employed by OpenAI for its AI-generated images. The DALL-E 3 text-to-image model produces visuals with metadata explaining their AI origin, including invisible digital watermarks that can even make it through any attempts to remove them through editing. 

But words are not the same as images. Even in the best circumstances, OpenAI admitted all it would take is a third-party tool to rephrase the AI-generated text and effectively make the watermark disappear. And, while OpenAI's new approach might work in many cases, the company didn't shy from highlighting its limits and even why it might not always be desirable to employ a successful watermark, regardless.

"While it has been highly accurate and even effective against localized tampering, such as paraphrasing, it is less robust against globalized tampering; like using translation systems, rewording with another generative model, or asking the model to insert a special character in between every word and then deleting that character - making it trivial to circumvention by bad actors," OpenAI explained in a blog post. "Another important risk we are weighing is that our research suggests the text watermarking method has the potential to disproportionately impact some groups."

AI Authorship Stamp

OpenAI is worried that the negative consequences of releasing this kind of AI watermarking will outweigh any positive impact. The company specifically cited those who use ChatGPT for productivity tasks, but could even lead to direct stigmatization or criticism of users who rely on generative AI tools, regardless of who they are and how they use them.

This might disproportionately affect non-English users of ChatGPT, who employ translations and make content in a different language. The presence of watermarks might create barriers for these users, reducing the effectiveness and acceptance of AI-generated content in multilingual contexts. The potential backlash from users might lead to them abandoning the tool if they know their content can be easily identified as AI-generated. 

Notably, this isn't OpenAI's first AI text detector foray. However, the company ended up shutting the earlier detector down in just six months and later said such tools are ineffective in general, explaining why there isn't such an option in a teacher's guide for using ChatGPT. Still, the update suggests the research for a perfect way of spotting AI text without causing problems that drive people away from AI text generators is far from over.

You might also like...

TOPICS
Eric Hal Schwartz
Contributor

Eric Hal Schwartz is a freelance writer for TechRadar with more than 15 years of experience covering the intersection of the world and technology. For the last five years, he served as head writer for Voicebot.ai and was on the leading edge of reporting on generative AI and large language models. He's since become an expert on the products of generative AI models, such as OpenAI’s ChatGPT, Anthropic’s Claude, Google Gemini, and every other synthetic media tool. His experience runs the gamut of media, including print, digital, broadcast, and live events. Now, he's continuing to tell the stories people want and need to hear about the rapidly evolving AI space and its impact on their lives. Eric is based in New York City.

Read more
ChatGPT
ChatGPT wants to write your next novel, and readers and writers alike should be very worried
Sam Altman and OpenAI
Open AI bans multiple accounts found to be misusing ChatGPT
An iPhone showing the ChatGPT logo on its screen
OpenAI just updated its 187-page rulebook so ChatGPT can engage with more controversial topics
ChatGPT/DeepSeek
OpenAI changes ChatGPT o3-mini to work more like DeepSeek-R1, but faces backlash from users
Sam Altman and OpenAI
OpenAI launches a version of ChatGPT just for governments
An iPhone showing the ChatGPT logo on its screen
ChatGPT-4.5 is here for Pro users now and Plus users next week, and I can't wait to try it
Latest in Artificial Intelligence
ChatGPT and Gemini Deep Research
I pitted ChatGPT Deep Research against Gemini Deep Research - here's how Google's free tool compares to OpenAI's paid offering
Stability AI 3D Video
Stability AI’s new virtual camera turns any image into a cool 3D video and I’m blown away by how good it is
March Madness
ChatGPT helped me pick my March Madness bracket - I doubt I’ll win, but if I do I owe AI a chunk of that $1 million cash prize
Nvidia GTC 2025
Nvidia, Google, and Disney's AI-powered Star Wars robot is absolutely the droid I've been looking for
Google HEalth AI checkup updates
Google reveals 6 ways it's using AI to improve health care, from crowdsourced advice to personalized cancer treatments
A silhouette of a woman holding a smartphone with the Google Gemini logo in the background
Gemini Gems are now free - here are 4 ways you can use custom AI experts to help cope with the stresses of your busy life
Latest in News
A collage of Eve Macarro in Ballerina and John Wick in his third film
New Ballerina movie trailer suggests Keanu Reeves' John Wick will have a bigger role to play in the spin-off film than we thought
Stability AI 3D Video
Stability AI’s new virtual camera turns any image into a cool 3D video and I’m blown away by how good it is
The Google Wallet app with a mode for kids shown on-screen.
Google Wallet’s new kid-friendly payment system is a win for parents
A man holds a smartphone iPhone screen showing various social media apps including YouTube, TikTok, Facebook, Threads, Instagram and X
A worrying Apple Password App vulnerability reportedly left users exposed for months
Vertere DG-X turntable on a pink/white TechRadar background
Vertere's elite DG X turntable is modular, expensive, and hugely desirable
Google Pixel 9a
Google is delaying the Pixel 9a to fix a mystery “component quality issue”