Goodbye GPT-3.5, OpenAI's new GPT-4o mini AI model is all about compact power

OpenAI logo
(Image credit: OpenAI)

OpenAI has added a new large language model (LLM) called GPT-4o mini to ChatGPT and its APIs. As the name implies, the GPT-4o Mini model is a smaller version of the GPT-4o model introduced in May. The mini model is designed to balance the power of GPT-4o with a more cost-efficient approach.

GPT-4o mini has much of the functionality of its larger cousin, though the API only has text and vision support for now, with image, video, and audio inputs and outputs still in the works. Like GPT-4o, the new model has a context window of 128,000 tokens, or eight times that of GPT-3.5 Turbo. The new model also comes with enhanced safety features. Along with those built into GPT-4o already, GPT-4o mini added new techniques that make it more resistant to jailbreaks and improper prompt injections, among other issues concerning developers looking to deploy AI APIs broadly.

Ready for bigger jobs

OpenAI suggests the bigger context window and other upgrades, such as improved non-English text understanding, will make GPT-4o mini especially useful for processing big documents or linking multiple interactions with the AI model. For example, it could provide better recommendations in online stores, speed up real-time text responses for customer service, and produce accurate and detailed answers to students studying for an exam more quickly than other models. OpenAI has visions of GPT-4o automating and streamlining business processes thanks to its ability to fetch data and take actions with external systems. For businesses using the API, the cost is notably reduced to just over half the price per token of GPT-3.5 Turbo.

"OpenAI is committed to making intelligence as broadly accessible as possible," OpenAI explained in its announcement. "We expect GPT-4o mini will significantly expand the range of applications built with AI by making intelligence much more affordable."

GPT-4o mini is part of the recent wave of smaller LLMs like Google's Gemini Flash and Anthropic's Claude Haiku. According to OpenAI, however, GPT-4o mini blows them out of the water when it comes to many of the standard tests. The model scored 82% on the Massive Multitask Language Understanding (MMLU) benchmark, compared to 77.9% and 73.8% by Gemini Flash and Haiku, respectively. The same goes for the MGSM and Human Eval tests, where GPT-4o Mini hit 87% and 87.2%, while Gemini Flash had 75.5% and 71.5%, and Haiku had 71.7% and 75.9%. In other words, GPT-4o Mini wins out on textual comprehension in addition to math and coding tasks, as can be seen in the graph below. 

GPT-4o Mini Eval

(Image credit: OpenAI)

Mini Model Major Plans

The introduction of GPT-4o Mini represents a significant step in making advanced AI more affordable and accessible, according to OpenAI. Lower costs plus better performance will likely help incorporate AI into everyday applications. The same goes for ChatGPT users, who can all access the model starting this week. OpenAI also has plans to introduce fine-tuning capabilities for GPT-4o Mini within the API.

The broader picture shows another step in ChatGPT's evolving services. As OpenAI phases out GPT-3.5 for ChatGPT, the focus shifts to the next stage of providing more powerful models. OpenAI CEO Sam Altman has long hinted at how GPT-5 will "substantially improve" upon the existing models. At the same time, the leaked OpenAI scale for measuring AI power shows there is still a long way to go to the still-mythical artificial general intelligence (AGI) that can perfectly mimic the workings of the human mind. 

You might also like...

TOPICS
Eric Hal Schwartz
Contributor

Eric Hal Schwartz is a freelance writer for TechRadar with more than 15 years of experience covering the intersection of the world and technology. For the last five years, he served as head writer for Voicebot.ai and was on the leading edge of reporting on generative AI and large language models. He's since become an expert on the products of generative AI models, such as OpenAI’s ChatGPT, Anthropic’s Claude, Google Gemini, and every other synthetic media tool. His experience runs the gamut of media, including print, digital, broadcast, and live events. Now, he's continuing to tell the stories people want and need to hear about the rapidly evolving AI space and its impact on their lives. Eric is based in New York City.

Read more
ChatGPT o1
OpenAI starts the release of its o3 mini AI model for ChatGPT, and it's got a nice speed boost over o1
OpenAI o3-mini
OpenAI responds to the DeepSeek buzz by launching its latest o3-mini reasoning model for all users
An iPhone showing the ChatGPT logo on its screen
ChatGPT-4.5 is here for Pro users now and Plus users next week, and I can't wait to try it
ChatGPT logo
ChatGPT explained – everything you need to know about the AI chatbot
GPT 4.5
ChatGPT 4.5 understands subtext, but it doesn't feel like an enormous leap from ChatGPT-4o
Open AI
OpenAI is finally going to make ChatGPT a lot less confusing – and hints at a GPT-5 release window
Latest in Artificial Intelligence
A super close up image of the Google Gemini app in the Play Store
It's official: Google Assistant will be retired for phones this year, with Gemini taking over
Super Mario Odyssey
ChatGPT is the ultimate gaming tool - here's 4 ways you can use AI to help with your next playthrough
Apple CEO Tim Cook delivers remarks before the start of an Apple event at Apple headquarters on September 09, 2024 in Cupertino, California. Apple held an event to showcase the new iPhone 16, Airpods and Apple Watch models. (Photo by Justin Sullivan/Getty Images)
The big Siri Apple Intelligence delay proves that maybe we really don't know Apple at all
AI writer
Coding AI tells developer to write it himself
Apple iPhone 16 Pro Max REVIEW
Apple Intelligence is a fever dream that I bet Apple wishes we could all forget about
DeepSeek on an iPhone
OpenAI calls on US government to ban DeepSeek, calling it ‘state-subsidized’ and ‘state-controlled’
Latest in News
Google Pixel 8a in aloe green showing
Google Pixel 9a benchmark link teases the performance of the upcoming mid-ranger
Quordle on a smartphone held in a hand
Quordle hints and answers for Monday, March 17 (game #1148)
NYT Strands homescreen on a mobile phone screen, on a light blue background
NYT Strands hints and answers for Monday, March 17 (game #379)
NYT Connections homescreen on a phone, on a purple background
NYT Connections hints and answers for Monday, March 17 (game #645)
Apple iPhone 16 Pro HANDS ON
Leaked iPhone 17 dummy units may have given us our best look yet at all four models
A super close up image of the Google Gemini app in the Play Store
It's official: Google Assistant will be retired for phones this year, with Gemini taking over