Character.AI institutes new safety measures for AI chatbot conversations

Character AI
(Image credit: Character AI)

Character.AI has rolled out new safety features and policies for building and interacting with the AI-powered virtual personalities it hosts. The new measures aim to make the platform safer for all users, but particularly younger people. The update includes more control over how minors engage with the AI chatbot, more content moderation, and better detection of the AI discussing topics like self-harm.

Though not cited in the blog post about the update, Character AI linked to the announcement in a post on X expressing condolences to the family of a 14-year-old who spent months interacting with one of Character.AI's chatbots before taking his own life. His family has now filed a lawsuit against Character.AI for wrongful death, citing a lack of safeguards for the AI chatbots as a contributor to his suicide​.

AI chat guardrails

Character AI's post laid out several new safety features for the platform. For instance, if the model detects keywords related to suicide or self-harm, it will display a pop-up urging the user to the National Suicide Prevention Lifeline and related resources. The AI will also be better at spotting and removing inappropriate content in a conversation, with a particular sensitivity to when users are under 18.

Presumably, minors would already have restricted content in conversations, but Character.AI may have upped that sensitivity further. In cases where that might not be enough, entire chatbots have been removed.

"We conduct proactive detection and moderation of user-created Characters, including using industry-standard and custom blocklists that are regularly updated. We proactively, and in response to user reports, remove Characters that violate our Terms of Service," Character.AI explained in its post. "Users may notice that we’ve recently removed a group of Characters that have been flagged as violative, and these will be added to our custom blocklists moving forward."

Other new features are more about helping ground users. So, you'll see a notification when you have spent an hour on the platform asking if you want to keep going as a way of helping make sure you don't lose track of time. You'll also see more prominent disclaimers emphasizing that the AI is not a real person. There are already such disclaimers in the conversations, but Character.AI wants to make it impossible to ignore.

These safety features are the flipside of how Character.AI has made engaging with chatbots feel more like talking to a real person, including voices and the two-way voice conversations available with the Character Calls feature. Still, the company is likely keen to ensure its services are as safe as possible, and its moves could inform how others in the space shape their own AI chatbot characters.

You Might Also Like...

Eric Hal Schwartz
Contributor

Eric Hal Schwartz is a freelance writer for TechRadar with more than 15 years of experience covering the intersection of the world and technology. For the last five years, he served as head writer for Voicebot.ai and was on the leading edge of reporting on generative AI and large language models. He's since become an expert on the products of generative AI models, such as OpenAI’s ChatGPT, Anthropic’s Claude, Google Gemini, and every other synthetic media tool. His experience runs the gamut of media, including print, digital, broadcast, and live events. Now, he's continuing to tell the stories people want and need to hear about the rapidly evolving AI space and its impact on their lives. Eric is based in New York City.

Read more
Character.AI Calls
What is Character AI: Engage with a host of personalities
Character AI Games
Character.AI levels up its chatbots with new games
An iPhone showing the ChatGPT logo on its screen
OpenAI just updated its 187-page rulebook so ChatGPT can engage with more controversial topics
Woman using a mobile phone with ChatGPT on the screen.
Can ChatGPT really replace a therapist? We spoke to mental health experts to find out
Woman with multiple personalities
ChatGPT is getting powerful new custom personalities – and they could change how you use the AI chatbot
DeepSeek on a mobile phone
Australian and Indian governments block DeepSeek from worker devices
Latest in Artificial Intelligence
The Claude, ChatGPT, Google Gemini and Perplexity logos, clockwise from top left
The ultimate AI search face-off - I pitted Claude's new search tool against ChatGPT Search, Perplexity, and Gemini, the results might surprise you
Dream Machine on a laptop.
What is Dream Machine: everything you need to know about the AI video generator
Apple Intelligence Bella Ramsey ad
The Bella Ramsey Apple Intelligence ad that disappeared, and why Apple is now facing a false advertising lawsuit
Google Gemini Canvas
Is Gemini Canvas better than ChatGPT Canvas? I tested out both AI writing tools to find out which is king
Hugging Snap
This AI app claims it can see what I'm looking at – which it mostly can
Apple's Craig Federighi presents Apple Intelligence at the 2024 Worldwide Developers Conference (WWDC).
Apple Intelligence might finally transform Siri into the ultimate AI assistant if these leadership changes are true
Latest in News
Quordle on a smartphone held in a hand
Quordle hints and answers for Sunday, March 23 (game #1154)
NYT Strands homescreen on a mobile phone screen, on a light blue background
NYT Strands hints and answers for Sunday, March 23 (game #385)
NYT Connections homescreen on a phone, on a purple background
NYT Connections hints and answers for Sunday, March 23 (game #651)
Google Pixel 9 Pro Fold main display opened
Apple is rumored to be prioritizing battery life on the foldable iPhone – which could also feature a liquid metal hinge for added durability
Google Pixel 9
The Google Pixel 10 just showed up in Android code – and may come with a useful speed boost
L-mount alliance
Sirui joins L-Mount Alliance to deliver its superb budget lenses for Leica, DJI, Sigma and Panasonic cameras