Amazon unveils surprise new video and image AI models to compete with the best on the market

Amazon Nova image generation AI model
(Image credit: Amazon)

  • Amazon unveils new image and video creation AI tools
  • Amazon Nova Canvas and Nova Reel look to help ecommerce sellers
  • Both new Nova models available now on Bedrock

Amazon has announced new image and video generation models as it steps up its fight to become an AI heavyweight.

The company unveiled Amazon Nova Canvas and Nova Reel at its AWS re:Invent 2024 event in Las Vegas, with CEO Andy Jassy revealing the launch as part of a new Nova series of AI models.

Both new models are on Bedrock now, with the launches set to take Amazon into direct competition with the likes of OpenAI and Grok when it comes to image and video creation.

Amazon Nova Canvas and Reel

The new models look to initially target sellers and other users on Amazon's ecommerce platform, allowing them to quickly and cheaply create media content to enrich their pages.

Amazon didn't reveal too much in the way of specifics when it came to the new offerings, but did reveal Nova Canvas will allow users to create and edit images using natural language text inputs, and Nova Reel can provide "studio-quality" video, with features such as camera motion control, 360-degree rotation, and zoom.

In a blog post announcing the news, the company noted that customers on its Amazon Ads platform using the new models advertised five times more products and twice as many images per advertised product, widening their reach to buyers across the globe.

Looking forward, Jassy also revealed Amazon will be launching a Speech-to-Speech generation model in early 2025, followed by an "Any-to-Any" model in mid-2025.

The former will be able to analyse and understand streaming speech input in natural language, with the ability to interpret verbal and nonverbal cues such as tone and cadence, to reply in a natural, human-esque way.

The latter, which Jassy described as a true multimodal to multimodal model, will be able to take in text, images, audio, and video, before outputting in whichever mode is required.

You may also like

TOPICS
Mike Moore
Deputy Editor, TechRadar Pro

Mike Moore is Deputy Editor at TechRadar Pro. He has worked as a B2B and B2C tech journalist for nearly a decade, including at one of the UK's leading national newspapers and fellow Future title ITProPortal, and when he's not keeping track of all the latest enterprise and workplace trends, can most likely be found watching, following or taking part in some kind of sport.

Read more
Canva Magic Studio
What is Canva Magic Studio? Everything we know about the best AI graphic design service
A hand reaching out to touch a futuristic rendering of an AI processor.
“Does generative AI replace people? I strongly don’t believe so” - AWS generative AI VP on the future of work, agents and why Amazon can lead the way
A scientist looking through a microscope generated by Google Veo 2
Google’s new Veo 2 beats OpenAI Sora with 4K AI video generation – here’s how to try it
Sora-generated image
What is OpenAI's Sora? The text-to-video tool explained and how you can use it
Pika 2.0
Pika challenges OpenAI and Sora with new AI video generator features
AI Studios
What is AI Studios by DeepBrain? Everything we know about the AI avatar maker
Latest in Pro
Hands typing on a keyboard surrounded by security icons
Outdated ID verification myths put businesses at risk
Google Chrome dark mode
Google updates Chrome extension rules to ban affiliate link injection without user action or benefit
Abstract image of robots working in an office environment including creating blueprint of robot arm, making a phone call, and typing on a keyboard
This worrying botnet targets unsecure TP-Link routers - thousands of devices already hacked
Windows 10 button on a keyboard
Microsoft’s Remote Desktop app becomes the Windows App
Abstract image of cyber security in action.
Four key questions to strengthen your cyber threat detection strategy
Avast cybersecurity
UK cybersecurity sector could be worth £13bn, research shows
Latest in News
Two Android phones on a green and blue background showing Google Messages
Struggling with slow Google Messages photo transfers? Google says new update will make 'noticeable difference'
Elayne, Egwene, and Nynaeve dressed regally and on horseback in The Wheel of Time season 3
'There's a reason why we do it': The Wheel of Time showrunner responds to fans who are still upset over the Prime Video show's plot alterations
Google Pixel 9
Android 16 could bring an improved Samsung DeX-style desktop mode to more phones
An Nvidia GeForce RTX 4060 Ti
Nvidia could unleash RTX 5060 and 5060 Ti GPUs on PC gamers tomorrow, but there’s no sign of rumored RTX 5050 yet
AI writing
ChatGPT just wrote the most beautiful short story, and I wonder what I'm even doing here
Google Chrome dark mode
Google updates Chrome extension rules to ban affiliate link injection without user action or benefit