The maker of the world’s largest chip has made a major AI breakthrough

Cerebras WSE-2
(Image credit: Cerebras)

Cerebras Systems, maker of the world’s largest processor, has broken the record for the most complex AI model trained using a single device.

Using one CS-2 system, powered by the company’s wafer-sized chip (WSE-2), Cerebras is now able to train AI models with up to 20 billion parameters thanks to new optimizations at the software level.

The firm says the breakthrough will resolve one of the most frustrating problems for AI engineers: the need to partition large-scale models across thousands of GPUs. The result is an opportunity to drastically cut the time it takes to develop and train new models.

Cerebras brings AI to the masses

In sub-disciplines like natural language processing (NLP), the performance of the model correlates in a linear fashion with the number of parameters. In other words, the larger the model, the better the end result.

Today, developing large-scale AI products traditionally involves spreading a model across a large number of GPUs or accelerators, either because there are too many parameters to be housed within memory or compute performance is insufficient to handle training workloads.

“This process is painful, often taking months to complete,” explained Cerebras. To make matters worse, the process is unique to each network compute cluster pair, so the work is not portable to different compute clusters, or across neural networks. It is entirely bespoke.”

Although the most complex models consist of many more than 20 billion parameters, the ability to train relatively large-scale AI models on a single CS-2 device eliminates these bottlenecks for many, accelerating development for existing players and democratizing access for those previously unable to participate in the space.

“Cerebras’ ability to bring large language models to the masses with cost-efficient, easy access opens up an exciting new era in AI. It gives organizations that can’t spend tens of millions an easy and inexpensive on-ramp to major league NLP,” said Dan Olds, Chief Research Officer, Intersect360 Research.

“It will be interesting to see the new applications and discoveries CS-2 customers make as they train GPT-3 and GPT-J class models on massive datasets.”

What’s more, Cerebras hinted that its CS-2 system may be able to handle even larger models in future, with “even trillions of parameters”. And chaining together multiple CS-2 systems, meanwhile, could pave the way for AI networks larger than the human brain.

Joel Khalili
News and Features Editor

Joel Khalili is the News and Features Editor at TechRadar Pro, covering cybersecurity, data privacy, cloud, AI, blockchain, internet infrastructure, 5G, data storage and computing. He's responsible for curating our news content, as well as commissioning and producing features on the technologies that are transforming the way the world does business.

Read more
Cerebras WSE-3
DeepSeek on steroids: Cerebras embraces controversial Chinese ChatGPT rival and promises 57x faster inference speeds
Nvidia H800 GPU
A look at the unbelievable Nvidia GPU that powers DeepSeek's AI global ambition
SambaNova runs DeepSeek
Nvidia rival claims DeepSeek world record as it delivers industry-first performance with 95% fewer chips
A hand reaching out to touch a futuristic rendering of an AI processor.
DeepSeek and the race to surpass human intelligence
Engineer, Scientists and Developers Gathered Around Illuminated Conference Table in Technology Research Center, Talking, Finding Solution and Analysing Industrial Engine Design. Close-up Hands Shot
From lab to life - atomic-scale memristors pave the way for brain-like AI and next-gen computing power
Trillium TPU
You can now rent Google's most powerful AI chip: Trillium TPU underpins Gemini 2.0 and will put AMD and Nvidia on high alert
Latest in Pro
A man holds a smartphone iPhone screen showing various social media apps including YouTube, TikTok, Facebook, Threads, Instagram and X
Ofcom cracks down on UK tech firms, will issue sanctions for illegal content
3d rendering of a submarine power cable on the seabed
Subsea internet cables can now ‘listen’ for sabotage using irregular pulses of light
AI writer
AI innovation in business: moving beyond scale to drive real results
Cyber-security
Dealing with the issue of CISO stress
HP OfficeJet Pro 9012e main image
I tested the HP OfficeJet Pro 9012e - read why this is a cracking home printer
HP LaserJet Pro MFP 4302fdw main image
I tried the HP LaserJet Pro MFP 4302fdw - read why it disappoints
Latest in News
A man holds a smartphone iPhone screen showing various social media apps including YouTube, TikTok, Facebook, Threads, Instagram and X
Ofcom cracks down on UK tech firms, will issue sanctions for illegal content
Google Chromecast 2
Google rolls out another Chromecast bug fix for users who factory-reset their devices
A Starfew Valley theme on Wear OS
Someone made a Stardew Valley theme for Wear OS and it's perfect
PS5 Pro feature
New Playstation studio is helmed by veteran Call of Duty dev and has been 'working away in the shadows'
3d rendering of a submarine power cable on the seabed
Subsea internet cables can now ‘listen’ for sabotage using irregular pulses of light
Google Pixel 9 front and back
The Google Pixel 9a has gone up for sale and it’s not even out yet