Meta wants to fix broken Wikipedia citations for good

The Meta logo on a smartphone in front of the Facebook logo a little bit blurred in the background
(Image credit: Shutterstock / rafapress)

Wikipedia may be the go-to resource on almost everything these days, but according to Meta, it's filled with dodgy, inaccurate citations.

But don't worry, the company says its AI is here to help, having developed Sphere, a model capable of automatically scanning hundreds of thousands of citations at once to check whether they truly support the corresponding claims.

Meta claims it created a new dataset of 134 million public web pages as a knowledge source for the model, which says is "an order of magnitude larger and significantly more intricate than ever used for this sort of research".

Meta citations

Sphere uses open web data rather than traditional, proprietary search engines such as Google, and has already compiled 134 million documents from across the web.

Built using CCNet, a variant of Common Crawl, Meta says Sphere will help other AI researchers working on knowledge retrieval projects.

Meta says the eventual goal of the project is to build a platform to help Wikipedia editors systematically spot citation issues and quickly fix the citation or correct the content of the corresponding article at scale. 

The company is not partnering with Wikimedia on the project, which is still in the research phase and is not being used to automatically update any content on Wikipedia.

The tool reportedly calls attention to questionable citations, allowing human editors to evaluate the cases most likely to be flawed without having to sift through thousands of properly cited statements. 

If a citation seems irrelevant, Meta says its model will suggest a more applicable source, even pointing to the specific passage that supports the claim.

You can grab the source code for the project on GitHub here, and interested parties can also read a full write-up of the project's findings here or access the demo here.

Will McCurdy has been writing about technology for over five years. He has a wide range of specialities including cybersecurity, fintech, cryptocurrencies, blockchain, cloud computing, payments, artificial intelligence, retail technology, and venture capital investment. He has previously written for AltFi, FStech, Retail Systems, and National Technology News and is an experienced podcast and webinar host, as well as an avid long-form feature writer.

Read more
Zuckerberg Meta AI
Meta purportedly trained its AI on more than 80TB of pirated content and then open-sourced Llama for the greater good
Oracle
Oracle signs up Meta for AI training deal
Google AI co-scientist overview
Scientists firmly in AI crosshairs as Google launches co-scientist scheme to accelerate scientific breakthroughs just days after another similar project
Facebook CEO Mark Zuckerberg
Meta reveals what kinds of AI even it would think too risky to release
Facebook CEO Mark Zuckerberg
Forget mega yachts, AI data centers are quickly becoming the next battleground for billionaires as Zuckerberg pledges $65 billion CAPEX spend in 2025
AI Education
This AI tool helps content creators block unauthorized scraping and manage bot interactions
Latest in Pro
Woman shocked by online scam, holding her credit card outside
Cybercriminals used vendor backdoor to steal almost $600,000 of Taylor Swift tickets
Customer service 3D manager concept. AI assistance headphone call center
The era of Agentic AI
Woman using iMessage on iPhone
UK government guidelines remove encryption advice following Apple backdoor spat
Cryptocurrencies
Ransomware’s favorite Russian crypto exchange seized by law enforcement
A hand reaching out to touch a futuristic rendering of an AI processor.
Balancing innovation and security in an era of intensifying global competition
Wordpress brand logo on computer screen. Man typing on the keyboard.
Thousands of WordPress sites targeted with malicious plugin backdoor attacks
Latest in News
A collage of Ellie and Joel in The Last of Us season 2
The Last of Us season 2's new trailer teases a huge showdown between Bella Ramsey's Ellie and Pedro Pascal's Joel, but the big moment I'm waiting for is still being held back
Apple iPhone 16 Pro Max REVIEW
New iPhone 17 Air leak may have revealed some key specs – and how it compares to the iPhone 17 Pro Max
Gaming with AI
I asked Gemini to play a text-based adventure game with me and the AI whisked me away to a word-based fantasy
Apple iPhone 16 Review
Three iPhone 17 model dummy units appear in a hands-on video leak
The Samsung Galaxy S25 Edge on display the January 22, 2025 Galaxy Unpacked event.
New Samsung Galaxy S25 Edge may have revealed some key details – including its price
Quordle on a smartphone held in a hand
Quordle hints and answers for Monday, March 10 (game #1141)