This tiny UK startup chipmaker is targeting Intel and Nvidia with a monster AI CPU

(Image credit: Graphcore)

Graphcore, a U.K.-based AI computing startup, has unveiled its new Colossus MK2 GC200 intelligence processing unit (IPU) featuring 59.4 billion transistors, making it currently the most complex chip in the world. 

The IPU was designed specifically for machine intelligence tasks and can scale out to offer 1 PetaFLOPS of FP16.16 compute horsepower in a 1U machine or up to 16 ExaFLOPS in a datacenter.

The most complex

The Graphcore Colossus MK2 GC200 IPU packs 1,472 independent IPU cores with SMT that can handle 8,832 separate parallel threads. Each core is equipped with its own memory and therefore the chip carries 900 MB of SRAM with an aggregated bandwidth of 47.5 TB/s per chip. Graphcore’s GC200 IPUs also features 10 IPU links to connect to other GC200 chips at up to 320 GB/s speed as well as a PCIe 4.0 x16 interface. 

(Image credit: Graphcore)

Graphcore’s Colossus MK2 GC200 chip us made using TSMC’s 7 nm process technology and contains 59.4 billion transistors, more than Nvidia’s A100 GPU that has 54 billion transistors.

Up to 16 ExaFLOPS

Each Colossus MK2 GC200 IPU can provide 250 TeraFlops of AI compute performance at FP16.16 and FP16.SR (stochastic rounding) precision as well as 62.5 TeraFLOPS of single-precision FP32 performance. Four of GC200 chips inside Graphcore’s IPU-M2000 system (which comes with 448 GB of exchange DRAM to handle large workloads) offer up to 1 PetaFLOPS of FP16.16/FP16.SR. 

(Image credit: Graphcore)

“Our Colossus IPUs are unique in having support for Stochastic Rounding on the arithmetic that is supported in hardware and runs at the full speed of the processor,” said Nigel Toon, CEO of Graphcore. “This allows the Colossus Mk2 IPU to keep all arithmetic in 16-bit formats, reducing memory requirements, saving on read and write energy and reducing energy in the arithmetic logic, while delivering full accuracy Machine Intelligence results.” 

Customers who need more performance can also order IPU-POD64 systems powered by 16 IPU-M2000 machines (and therefore providing 16 PetaFLOPS), whereas large organizations can scale out to 64,000 IPUs for 16 ExaFLOPS at FP16.16/FP16.SR. 

(Image credit: Graphcore)

Since IPUs cannot run operating systems, Grapcore’s IPU-M2000 box(es) have to be connected to a regular CPU-based node(s). Meanwhile, users are not confined to a fixed ratio of CPU to Machine Intelligence compute at a server level and can adjust their hardware mix. 

To take advantage of Graphcore’s Colossus IPUs, one needs to use the company’s proprietary Polar software tailor-made for the company’s architecture with particular silicon in mind. Graphcore says that it has put a lot of effort into optimizing its software stack in a bid to maximize performance one can extract from its hardware and make the whole solution easy to use. 

Available in Q4

Graphcore’s IPU-M2000 and IPU-POD64 systems are available to pre-order now with production volume shipments committing in the fourth quarter. Interested parties can get access and evaluate IPU-POD systems in the cloud offered by Cirrascale. 

At present, select customers are already evaluating Graphcore’s Colossus MK2 platform. 

“J.P. Morgan is evaluating Graphcore’s technology to see if our solutions can accelerate their advances in AI, specifically, in the NLP and speech recognition arenas,” a statement by the company reads.

Via The Verge

Anton Shilov is the News Editor at AnandTech, Inc. For more than four years, he has been writing for magazines and websites such as AnandTech, TechRadar, Tom's Guide, Kit Guru, EE Times, Tech & Learning, EE Times Asia, Design & Reuse.

Latest in Pro
UK Prime Minister Sir Kier Starmer
UK PM says AI should soon replace civil servants
Image depicting hands typing on a keyboard, with phishing hooks holding files, passwords and credit cards.
Microsoft warns about a new phishing campaign impersonating Booking.com
An image of network security icons for a network encircling a digital blue earth.
Why effective cybersecurity is a team effort
Data leak
Hacked Tata Technologies data leaked by ransomware gang
A close-up photo of an iPhone, with the App Store icon prominent in the center of the image.
Thousands of iOS apps found to expose user data and leak Stripe keys
Intel CEO Lip-Bu Tan
Intel reveals its new CEO
Latest in News
The Russo brothers posing for a photograph and Herman carrying a Volkswagen camper van in The Electric State
'We're optimists': AI enthusiasts Joe and Anthony Russo defend its use in movies and TV shows, but admit there are 'very real dangers' around its application
UK Prime Minister Sir Kier Starmer
UK PM says AI should soon replace civil servants
Xbox Copilot in Minecraft
Microsoft confirms Copilot can be tested by Xbox Insiders next month and shares new details about how the AI sidekick will enhance the player experience: 'It has to be about gameplay, it has to be personalized to you'
Eight Samsung TVs mounted to the wall showing different basketball games
Samsung is offering you 8 new TVs in one bundle for March Madness, in case you want to watch all games at once like a Bond villain’s lair
Image depicting hands typing on a keyboard, with phishing hooks holding files, passwords and credit cards.
Microsoft warns about a new phishing campaign impersonating Booking.com
The Steam Logo on a mobile phone in front of a wall of games.
Today’s Steam Spring Sale features my absolute favorite game of all time - here's when the sale starts and all the key info