Apple Mac Studio M3 Ultra workstation can run Deepseek R1 671B AI model entirely in memory using less than 200W, reviewer finds

Apple Mac Studio
(Image credit: Apple)

  • DeepSeek R1’s 671 billion parameters run smoothly on the M3 Ultra’s unified memory
  • Apple’s Mac Studio proves AI workloads don’t require expensive, power-hungry GPU clusters
  • M3 Ultra consumes under 200W, far less than traditional multi-GPU AI setups

Apple’s Mac Studio with the M3 Ultra chip has demonstrated a capability that no other personal computer can match, running the DeepSeek R1 AI tool with 671 billion parameters entirely in memory.

A test by YouTube reviewer Dave2D showed despite using a 4-bit quantized version of the model, it retained its full parameter count and performed smoothly.

The DeepSeek R1 model, a hefty 404GB of storage and high-bandwidth memory typically found in GPU VRAM, is usually run on multi-GPU setups that distribute processing across several high-end graphics cards.

A unique feat: running DeepSeek R1 in memory

However, the M3 Ultra’s unified memory system, instead of relying on external GPUs, uses its 512GB of unified memory to store and process the AI model in a way that no other personal computer can.

Although MacOS imposes a default VRAM limit, Dave Lee manually increased it through the Terminal to allocate up to 448GB for AI processing, eliminating memory bottlenecks and reducing the need for multiple components to streamline AI performance on a single system.

One of the most striking aspects of this test was the M3 Ultra's power efficiency, as it consumed less than 200W while running DeepSeek R1.

The ability to run such a demanding AI model without a multi-GPU setup challenges the industry standard, which relies on high-end Nvidia and AMD graphics cards, as the best workstations and server farms typically use GPU clusters that consume vast amounts of electricity.

Apple’s unified memory architecture enables significant power savings by sharing the M3 Ultra’s memory pool across CPU and GPU workloads, unlike conventional PC setups where VRAM is separate from system memory, maximizing bandwidth while minimizing energy use.

Apple’s Mac Studio, launched with the M3 Ultra chip, features up to a 32-core CPU and an 80-core GPU, making it one of the best LLM workstations and one of the best video editing computers.

Via Wccftech

You may also like

TOPICS
Efosa Udinmwen
Freelance Journalist

Efosa has been writing about technology for over 7 years, initially driven by curiosity but now fueled by a strong passion for the field. He holds both a Master's and a PhD in sciences, which provided him with a solid foundation in analytical thinking. Efosa developed a keen interest in technology policy, specifically exploring the intersection of privacy, security, and politics. His research delves into how technological advancements influence regulatory frameworks and societal norms, particularly concerning data protection and cybersecurity. Upon joining TechRadar Pro, in addition to privacy and technology policy, he is also focused on B2B security products. Efosa can be contacted at this email: udinmwenefosa@gmail.com

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.

Read more
Mac Studio on a desk
I compared Apple's Mac Studio M3 Ultra with 10 Windows workstations and I am truly shocked by what I found
A mockup of the possible Apple M3 Ultra logo
Performance isn't the only reason you should buy Apple's M3 Ultra Mac Studio - it's reportedly one of the most power-efficient processors too
Mac Studio M4 Max (2025)
Apple levels up the Mac Studio with the M4 Max and unveils its most powerful chip ever, the M3 Ultra
Mac Studio from above.
New benchmark suggests Apple's M3 Ultra may not be much faster than the M4 Max - only a minor uplift in multi-core performance
Mac Studio on a desk
Apple Mac Studio (M3 Ultra): the ultimate creative workstation
Cerebras WSE-3
DeepSeek on steroids: Cerebras embraces controversial Chinese ChatGPT rival and promises 57x faster inference speeds
Latest in Pro
cybersecurity
What's the right type of web hosting for me?
Security padlock and circuit board to protect data
Trust in digital services around the world sees a massive drop as security worries continue
Hacker silhouette working on a laptop with North Korean flag on the background
North Korea unveils new military unit targeting AI attacks
An image of network security icons for a network encircling a digital blue earth.
US government warns agencies to make sure their backups are safe from NAKIVO security issue
Laptop computer displaying logo of WordPress, a free and open-source content management system (CMS)
This top WordPress plugin could be hiding a worrying security flaw, so be on your guard
construction
Building in the digital age: why construction’s future depends on scaling jobsite intelligence
Latest in News
L-mount alliance
Sirui joins L-Mount Alliance to deliver its superb budget lenses for Leica, DJI, Sigma and Panasonic cameras
Security padlock and circuit board to protect data
Trust in digital services around the world sees a massive drop as security worries continue
Samuel and Romy standing very close together in A24's Babygirl movie
Everything new on Max in April 2025, including A24's Babygirl and The Last of Us season 2
An AMD Radeon RX 9070 XT made by Sapphire on a table with its retail packaging
AMD’s secret weapon against Nvidia seems to be stock – way more RX 9070 GPUs are rumored to be hitting shelves than RTX 5000 models
Hacker silhouette working on a laptop with North Korean flag on the background
North Korea unveils new military unit targeting AI attacks
Seth Milchick and Kier Eagan's animatronic speaking in Severance season 2 episode 10
Apple TV+ announces Severance has been renewed for season 3 after that devastating finale