AWS offers more flexible access to Nvidia GPUs for short-duration AI workloads
Running AI and ML workloads got a whole lot cheaper, for some
AWS, an already popular cloud computing service for developers looking to access the best-performing hardware for AI workloads, has announced a more flexible scheme for shorter-term requirements.
Amazon Elastic Compute Cloud (EC2) Capacity Blocks for ML is what Amazon is calling an industry-first, and will allow customers to access GPUs on a consumption-based model.
The Seattle-based cloud giant hopes that more affordable options will provide smaller organizations with greater opportunities, helping to make for a more diverse landscape.
AWS launches short-term consumption-based GPU renting
In a press release, the company said: “With EC2 Capacity Blocks, customers can reserve hundreds of Nvidia GPUs colocated in Amazon EC2 UltraClusters designed for high-performance ML workloads.”
Customers can get access to the latest Nvidia H100 Tensor Core GPUs, which are suited to training foundation models and large language models, by specifying cluster size and duration, meaning they only pay for what they need.
Amazon noted that demand for GPUs is fast outpacing supply as more businesses get to grips with generative AI, and many will either find themselves paying for an excessive service or having GPUs sitting dormant when they’re not in use – or worse still, both.
AWS users can reserve EC2 UltraClusters of P5 instances for between 1-14 days, and up to eight weeks in advance. They can pick flexible cluster size options, ranging from 1-64 instances, or a maximum of 512 GPUs.
Are you a pro? Subscribe to our newsletter
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
AWS Compute and Networking VP David Brown commented: “With Amazon EC2 Capacity Blocks, we are adding a new way for enterprises and startups to predictably acquire Nvidia GPU capacity to build, train, and deploy their generative AI applications – without making long-term capital commitments. It’s one of the latest ways AWS is innovating to broaden access to generative AI capabilities.”
Pricing for the service can be found on the AWS website, where prospective users can also sign up to use the short-term, affordable option.
More from TechRadar Pro
- Want to use something that’s already been built and tested? Here are the best AI tools
- AWS upgrades its Apple Mac stack for extra cloud power
- We’ve also rounded up the best cloud hosting providers
With several years’ experience freelancing in tech and automotive circles, Craig’s specific interests lie in technology that is designed to better our lives, including AI and ML, productivity aids, and smart fitness. He is also passionate about cars and the decarbonisation of personal transportation. As an avid bargain-hunter, you can be sure that any deal Craig finds is top value!