AMD lands yet another major cloud deal as Oracle adopts thousands of Instinct MI300X GPUs to power new AI supercluster

AMD Instinct MI300X GPU
(Image credit: AMD)

AMD’s Instinct MI300X is an incredibly powerful AI accelerator, and major cloud companies are beginning to integrate it into their infrastructure to support intensive AI workloads.

Vultr recently announced that it had ordered “thousands” of MI300X units, and now Oracle Cloud Infrastructure (OCI) says it has adopted AMD’s hardware for its new OCI Compute Supercluster instance, BM.GPU.MI300X.8.

The new supercluster is designed for massive AI models containing billions of parameters and supports up to 16,384 GPUs in a single cluster. This setup leverages the same high-speed technology used by other OCI accelerators, enabling large-scale AI training and inference with the memory capacity and throughput required for the most demanding tasks. The configuration makes it particularly suited for LLMs and complex deep learning operations.

Preproduction testing

“AMD Instinct MI300X and ROCm open software continue to gain momentum as trusted solutions for powering the most critical OCI AI workloads,” said Andrew Dieckmann, corporate vice president and general manager, Data Center GPU Business, AMD. “As these solutions expand further into growing AI-intensive markets, the combination will benefit OCI customers with high performance, efficiency, and greater system design flexibility.”

Oracle says its testing of the MI300X as part of its preproduction efforts validated the GPU’s performance in real-world scenarios. For the Llama 2 70B model, the MI300X achieved a 65 millisecond "time to first token" latency and scaled efficiently to generate 3,643 tokens across 256 concurrent user requests. In another test with 2,048 input and 128 output tokens, it delivered an end-to-end latency of 1.6 seconds, matching closely with AMD’s own benchmarks.

The OCI BM.GPU.MI300X.8 instance features 8 AMD Instinct MI300X accelerators, delivering 1.5TB of HBM3 GPU memory with a bandwidth of 5.3TB/s, paired with 2TB of system memory and 8 x 3.84TB NVMe storage. Oracle will be offering the bare-metal solution for $6 per GPU/hour.

“The inference capabilities of AMD Instinct MI300X accelerators add to OCI’s extensive selection of high-performance bare metal instances to remove the overhead of virtualized compute commonly used for AI infrastructure,” said Donald Lu, senior vice president of software development at Oracle Cloud Infrastructure. “We are excited to offer more choice for customers seeking to accelerate AI workloads at a competitive price point.”

More from TechRadar Pro

TOPICS
Wayne Williams
Editor

Wayne Williams is a freelancer writing news for TechRadar Pro. He has been writing about computers, technology, and the web for 30 years. In that time he wrote for most of the UK’s PC magazines, and launched, edited and published a number of them too.

Read more
AMD Instinct MI325X AI accelerator
Yay, you can now use AMD's fastest ever GPU - AMD's Instinct MI325X AI accelerator has 256GB memory and can run Crysis (sort of)
AMD instinct
AMD fast-tracks its most powerful AI GPU ever as it seeks to steal market sharefrom Nvidia's Blackwell B100 and B200
AMD logo
Mysterious die set to feature in AMD's Instinct MI400, its next blockbuster APU which could power El Capitan's successor
Comino Grando H100 Server
AMD-powered, liquid cooled Comino Grando AI sever gets reviewed but I still can't see any octo Nvidia RTX 5090 GPUs configuration
HPE
HPE may have beaten Supermicro and Dell to win a $1bn AI contract, but it's not for the Colossus supercomputer
Oracle
Oracle signs up Meta for AI training deal
Latest in Pro
A graphic showing fleet tracking locations over a city.
Lost & Found tracking site hit by major data breach - over 800,000 could be affected
US President Donald Trump speaks to the press as he signs an executive order to create a US sovereign wealth fund, in the Oval Office of the White House on February 3, 2025, in Washington, DC.
US set to pause cyber-offensive operations against Russia - but CISA says it won't stop
Web DDoS attacks see major surge as AI allows more powerful attacks
Polish space agency says it was hit by a cyberattack
Illustration of a hooked email hovering over a mobile phone
AWS misconfigurations reportedly used to launch phishing attacks
Hands typing on a keyboard surrounded by security icons
Your passwords aren't the key to protecting your online identity, your email address is
Latest in News
Google Pixel 9 Pro
Here are the 7 best Pixel 9 and Pixel Watch 3 features landing in March’s Pixel Feature Drop
Bang & Olufsen Beogram 4000C Saint Laurent Rive Droite Edition
Bang & Olufsen's latest reworked turntable is a masterpiece of retro revival, in a breathtaking wooden presentation box
Apple Watch Series 10
Apple unveils new Apple Watch bands – here's what's in the Spring 2025 collection
iPad Air M3
Apple makes one hardware change to the iPad Air that might be the best indicator of its true lightweight tablet intentions
Shure MoveMic 88+ lifestyle image
Shure's tiny MoveMic 88+ gives creators a cheap and easy way to record crystal clear audio on a smartphone
An operator fires a saw blade from a weapon
Call of Duty: Black Ops 6 Season 3 gets two-week delay, will now release in April