New OpenAI GPT-4 service will help spot errors in ChatGPT coding suggestions

OpenAI logo
(Image credit: OpenAI)

In a bid to increase the usefulness of generative AI tools to developers, OpenAI has introduced CriticGPT, a new model it says can help identify errors in ChatGPT code outputs.

Based on GPT-4, OpenAI claims CriticGPT has been able to outperform unaided efforts 60% of the time, showing its ability to enhance human performance in code review tasks, rather than replace human workers.

OpenAI’s initiative aims to refine the ‘Reinforcement Learning from Human Feedback’ (RLHF) process in order to ensure higher quality and greater reliability in AI systems.

OpenAI launches new code-checking model

OpenAI’s latest GPT-4 series, which powers publicly available versions of ChatGPT, relies heavily on RLHF to ensure that its outputs are both reliable and interactive. Up until now, this process has been a manual one that has leaned on the human power of AI trainers, who have rated ChatGPT responses to improve the model’s performance.

With the launch of CriticGPT, OpenAI can now critique ChatGPT’s answers autonomously, which addresses concerns over the AI chatbot becoming too sophisticated for many human trainers.

CriticGPT was trained by trainers providing feedback after inserting intentional mistakes into ChatGPT-generated code. The results were promising, with CriticGPT’s critiques preferred by trainers around two-thirds (63%) of the time thanks to the tool’s ability to reduce nitpicks and hallucinations.

However, the project isn’t without its limitations, and AI-human collaboration continues to prove more effective compared to AI alone.

In its announcement, OpenAI summarized: “CriticGPT’s suggestions are not always correct, but we find that they can help trainers to catch many more problems with model-written answers than they would without AI help.”

The company also acknowledged that “mistakes can be spread across many parts of an answer,” which makes it more complex for an AI tool to identify the cause.

Looking ahead, OpenAI has confirmed plans to scale its work on CriticGPT and to put it into practice.

More from TechRadar Pro

TOPICS
Craig Hale

With several years’ experience freelancing in tech and automotive circles, Craig’s specific interests lie in technology that is designed to better our lives, including AI and ML, productivity aids, and smart fitness. He is also passionate about cars and the decarbonisation of personal transportation. As an avid bargain-hunter, you can be sure that any deal Craig finds is top value!

Read more
ChatGPT
ChatGPT wants to write your next novel, and readers and writers alike should be very worried
A profile of a human brain against a digital background.
Securely working with AI-generated code
A laptop screen showing a ChatGPT coding panel
The ChatGPT Mac app just got a massive coding upgrade – and it’s coming to Windows soon
OverallGPT
What is OverallGPT? Everything we know about the AI writing platform
An iPhone showing the ChatGPT logo on its screen
ChatGPT-4.5 is here for Pro users now and Plus users next week, and I can't wait to try it
ChatGPT Canvas
OpenAI is making ChatGPT's Canvas feature smarter and sleeker
Latest in Pro
Branch office chairs next to a TechRadar-branded badge that reads Big Savings.
This office chair deal wins the Amazon Spring Sale for me and it's so good I don't expect it to last
Saily eSIM by Nord Security
"Much more than just an eSIM service" - I spoke to the CEO of Saily about the future of travel and its impact on secure eSIM technology
NetSuite EVP Evan Goldberg at SuiteConnect London 2025
"It's our job to deliver constant innovation” - NetSuite head on why it wants to be the operating system for your whole business
FlexiSpot office furniture next to a TechRadar-branded badge that reads Big Savings.
Upgrade your home office for under $500 in the Amazon Spring Sale: My top picks and biggest savings
Beelink EQi 12 mini PC
I’ve never seen a PC with an Intel Core i3 CPU, 24GB RAM, 500GB SSD and two Gb LAN ports sell for so cheap
cybersecurity
Chinese government hackers allegedly spent years undetected in foreign phone networks
Latest in News
DeepSeek
Deepseek’s new AI is smarter, faster, cheaper, and a real rival to OpenAI's models
Open AI
OpenAI unveiled image generation for 4o – here's everything you need to know about the ChatGPT upgrade
Apple WWDC 2025 announced
Apple just announced WWDC 2025 starts on June 9, and we'll all be watching the opening event
Hornet swings their weapon in mid air
Hollow Knight: Silksong gets new Steam metadata changes, convincing everyone and their mother that the game is finally releasing this year
OpenAI logo
OpenAI just launched a free ChatGPT bible that will help you master the AI chatbot and Sora
An aerial view of an Instavolt Superhub for charging electric vehicles
Forget gas stations – EV charging Superhubs are using solar power to solve the most annoying thing about electric motoring