Artificial intelligence can now complete your kid's mathematics homework

mathematics
(Image credit: Shutterstock / Halfpoint)

Researchers have successfully developed an AI system capable of completing mathematics problems at a grade school level, a new report asserts.

Traditionally, while AI models are proficient at manipulating language to formulate sentences, the multi-step reasoning required to solve math problems has been a step too far.  

However, researchers at OpenAI (the company behind language model GPT-3) say they have trained a model to recognize its own mistakes, which means it can repeatedly reassess until it discovers a workable solution.

In testing, the AI system was able to solve almost as many problems as a sample of children between the ages of nine and twelve. The children scored 60% on a test drawn down from the OpenAI database, while the AI system scored 55%.

AI takes on mathematics

Although grade school mathematics problems are simple enough for most people to complete with ease, OpenAI says the arrival of AI models capable of solving even basic math challenges is a major step forward and will unlock a number of opportunities.

“One significant challenge in mathematical reasoning is the high sensitivity to individual mistakes,” explained the researchers. “Autoregressive models, which generate each solution token by token, have no mechanism to correct their errors. Solutions that veer off-course quickly become unrecoverable.” 

OpenAI worked around this problem by training a set of verifiers, the role of which was to evaluate the answers produced by the AI model. These verifiers were given 100 potential solutions, all generated by the model, and were then tasked with determining whether any were correct.

“Providing correct arguments and recognizing incorrect ones are key challenges in developing more general AI,” OpenAI added. “[Grade school] problems are conceptually simple, yet one subtle mistake is enough to derail an entire solution. Identifying and avoiding such mistakes is a crucial skill for our models to develop.”

The company believes the verification system that allows its AI systems to solve simple math problems with relative accuracy will become increasingly important as AI is applied to more complex domains.

In combination with advances in the field of semiconductors, which will make possible AI models that are many times larger (and therefore more capable) than they are today, the ability to tweak the way in which AI approaches a problem could prove transformative.

Also check out our lists of the best cloud hosting services, best bare metal hosting and best dedicated server hosting.

TOPICS
Joel Khalili
News and Features Editor

Joel Khalili is the News and Features Editor at TechRadar Pro, covering cybersecurity, data privacy, cloud, AI, blockchain, internet infrastructure, 5G, data storage and computing. He's responsible for curating our news content, as well as commissioning and producing features on the technologies that are transforming the way the world does business.

Read more
Humanity's Last Exam
OpenAI's Deep Research smashes records for the world's hardest AI exam, with ChatGPT o3-mini and DeepSeek left in its wake
AI
ChatGPT is the homework helper for more than a quarter of teens – and the trend is accelerating
Humanity's Last Exam
Could you pass 'Humanity’s Last Exam'? Probably not, but neither can AI
StudyFetch
Your new favorite teacher might be this AI educator that never loses their patience
AI Learning for kids
AI doesn't belong in the classroom unless you want kids to learn all the wrong lessons
ChatGPT
ChatGPT wants to write your next novel, and readers and writers alike should be very worried
Latest in Pro
Zendesk Relate 2025
Zendesk Relate 2025 - everything you need to know as the event unfolds
Microsoft
"Another pair of eyes" - Microsoft launches all-new Security Copilot Agents to give security teams the upper hand
Lock on Laptop Screen
Medusa ransomware is able to disable anti-malware tools, so be on your guard
AI quantization
What is AI quantization?
US flags
US government IT contracts set to be centralized in new Trump order
An abstract image of digital security.
Fake file converters are stealing info, pushing ransomware, FBI warns
Latest in News
Zendesk Relate 2025
Zendesk Relate 2025 - everything you need to know as the event unfolds
Disney Plus logo with popcorn
You can finally tell Disney+ to stop bugging you about that terrible Marvel show you regret starting
Google Gemini AI
Gemini can now see your screen and judge your tabs
Girl wearing Meta Quest 3 headset interacting with a jungle playset
Latest Meta Quest 3 software beta teases a major design overhaul and VR screen sharing – and I need these updates now
Philips Hue
Philips Hue might be working on a video doorbell, and according to a new report, we just got our first look at it
Microsoft
"Another pair of eyes" - Microsoft launches all-new Security Copilot Agents to give security teams the upper hand