Forget Intel and AMD - Nvidia's next big competitor might be a company you've never heard of

Cerebras CS-3 AI Chip
(Image credit: Cerebras)

In recent years, Nvidia has found a huge amount of success with its pivot to AI, as large language models and GPU-accelerated "premium AI PC" experiences appear to be the hot new thing in 2024. However, newer, smaller companies are vying for its market share, and they are not the ones you may be expecting.  

As reported by The Economist, there have been developments in the GPU field outside of the best graphics cards made by Nvidia and AMD for AI computing. That's because some of today's large language models run across many setups featuring interconnected GPUs and memory, such as with Cerebras' hardware

Cerebras Systems Inc. was founded just nine years ago but seems to benefit massively from the recent AI computing boom. It's innovated in ways that appear to put the current-gen H100 and the upcoming GB200 die to shame with a "single, enormous chip" cable of up to 900,000 GPU cores - such as with its CS-3 chip. 

The Cerebras CS-3 chip absolutely dwarfs the double die size of the huge GB200, and is the size of a steering wheel, requiring two hands to hold. It's been described by the manufacturer as the "world's fastest and most scalable AI accelerator" which is purpose-built to "train the world's most advanced AI models". 

Cerebras CS-3 chart

(Image credit: Cerebras)

Furthermore, the American manufacturer has doubled down stating that its Wafer Scale Engine is "the chip that broke Moore's Law". As its internal benchmarks show, the CS-3 sits confidently above the H100 with a total of 10,000,000,000,000 transistors. For reference, the GB200 is set to feature 208,000,000,000, as the CS-3 features a staggering increase of 4,707%.

However, it's not just Cerebras that is making moves here, as new start-up company Groq is also developing hardware for AI computing, too. Instead of going larger than its competition, it has instead developed what it calls dedicated LPUs (language processing units) which are built to run large language models effectively and quickly. 

In the company's own words, the Groq LPU Inference Engine is an "end-to-end inference system acceleration system, to deliver substantial performance, efficency, and precision in a simple design". It's currently running Llama-2 70B, a large-scale generative language and text model, at 300 tokens per user by second. 

This feat is possible because of the LPU which resides in the data center in tandem with the CPU and GPU, and this makes low latency and real-time delivery a possibility. Think of it as having a more sophisticated NPU at the heart of a chip just fine-tuned to one specific purpose and on a much grander scale and you're right on the money. 

The profitability of the AI market means competition

Nvidia's financial success in recent years has been no secret as Team Green was even briefly more valuable than Amazon, even giving Alphabet (Google's parent company) a run for its money. With figures like that, it's not surprising that more manufacturers are throwing their hats into the ring and going for the jugular. 

Whether the likes of Cerebras and Groq, or even smaller companies such as MatX, have a chance here remains to be seen, however, as AI computing is still largely in its infancy, now is the time we'll be seeing the most experimentation with how the hardware can cater to the end user. Some will scale up, others will work smarter. 

You may also like...

TOPICS
Aleksha McLoughlin
Contributor

Formerly TechRadar Gaming's Hardware Editor, Aleksha McLoughlin is now a freelance writer and editor specializing in computing tech, video games, and E-commerce. As well as her many contributions to this site, you'll also find her work available on sister sites such as PC Gamer, GamesRadar, and Android Central. Additionally, more of her bylines can be found on Trusted Reviews, Dexerto, Expert Reviews, Techopedia, PC Guide, VideoGamer, and more.

Read more
Representation of AI
These are the 10 hottest AI hardware companies to follow in 2025
A person holding out their hand with a digital AI symbol.
AI smartphone and laptop sales are said to be slowly dying – but is anyone surprised?
Project DIGITS - front view
I am thrilled by Nvidia’s cute petaflop mini PC wonder, and it’s time for Jensen’s law: it takes 100 months to get equal AI performance for 1/25th of the cost
Cerebras WSE-3
DeepSeek on steroids: Cerebras embraces controversial Chinese ChatGPT rival and promises 57x faster inference speeds
Nvidia H800 GPU
A look at the unbelievable Nvidia GPU that powers DeepSeek's AI global ambition
Nvidia HQ
Nvidia calls DeepSeek an 'excellent AI advancement' and praises the Chinese AI app's ingenuity
Latest in GPU
AMD Radeon RX 6000 Series Graphics Card on top wooden desk beside a keyboard
How to update AMD GPU drivers
A character riding their horse through the Japanese landscape of in Rise of the Ronin
Another day, another dreadful PC port - Rise of the Ronin joins the list of woeful PC launches with even an Nvidia RTX 4090 succumbing to stutters
An AMD Radeon RX 9070 XT made by Sapphire on a table with its retail packaging
AMD describes its recent RDNA 4 GPU launch as 'unprecedented' and promises restocking the Radeon RX 9070 XT as 'priority number one'
An AMD Radeon RX 9070 XT vs RX 9070 against a red two-tone background
Well, AMD's Radeon RX 9070 series launch isn't going as smoothly as we thought - and it's because retailers have inflated prices
An Nvidia GeForce RTX 5070
Nvidia RTX 5080 stock is so barren that retailers are holding competitions where you can "win" the right to buy one for MSRP
An Nvidia GeForce RTX 4060 Ti
Nvidia could unleash RTX 5060 and 5060 Ti GPUs on PC gamers tomorrow, but there’s no sign of rumored RTX 5050 yet
Latest in News
A super close up image of the Google Gemini app in the Play Store
It's official: Google Assistant will be retired for phones this year, with Gemini taking over
Quordle on a smartphone held in a hand
Quordle hints and answers for Sunday, March 16 (game #1147)
NYT Strands homescreen on a mobile phone screen, on a light blue background
NYT Strands hints and answers for Sunday, March 16 (game #378)
NYT Connections homescreen on a phone, on a purple background
NYT Connections hints and answers for Sunday, March 16 (game #644)
Three iPhone 16 handsets on show
Apple could launch an iPhone 17 Ultra this year – but we've heard these rumors before
Super Mario Odyssey
ChatGPT is the ultimate gaming tool - here's 4 ways you can use AI to help with your next playthrough