Mining internet history unearths hidden gems

Internet connection
Next time you send data down one of these, just remember it might last forever

It may seem that the internet is all about the here and now, but a new project from Adobe and a US university shows that there's gold to be mined in the accumulated layers of websites as they age.

Researchers at Adobe's Advanced Technologies Lab and Washington University have come up with a project they call Zoetrope that makes the history of the Web useful in a way sites like the Internet Archive can never be.

Crawling archives

The software stores the content of selected websites every hour and uses that to create searchable views of how they change over time.

Users can simply scroll through versions of a site – a news page, for example – to see what happened and how it developed or can focus on specific parts of a page.

Possibilities include tracking product prices to establish trends or even comparing specific variables on different sites.

Simpler processes

Zoetrope might easily show correlations between the number of goals scored in football matches and the amount of money spent on players' salaries, for example.

While such comparisons could be done be hand, the point of the project is to make it simple and intuitive to do so using much of what we already have scattered across different sites.

Indexing everything?

Zoetrope currently has a four-month database of 1,000 popular websites as its starting point. Researcher Eytan Adar explains: "It's impossible to crawl and capture some of these things at the rate at which they're changing.

"But for something like Zoetrope, it's a smaller percentage of the Web that we want to track. We don't actually need to get every single page that's out there."

Nevertheless, if sites like the Internet Archive come good on plans to share their data with Zoetrope, we could soon be looking at the internet in a very different way.

TOPICS

J Mark Lytle was an International Editor for TechRadar, based out of Tokyo, who now works as a Script Editor, Consultant at NHK, the Japan Broadcasting Corporation. Writer, multi-platform journalist, all-round editorial and PR consultant with many years' experience as a professional writer, their bylines include CNN, Snap Media and IDG.

Latest in Creative Software
Adobe Photoshop
Adobe's Photoshop and Lightroom photo plans get a huge price hike, but there's a way to avoid it
Screenshot showing the adjustment brush in Adobe Photoshop
Adobe Photoshop CC (2024) review: the best photo editor gets even better
Adobe Creative Cloud apps on orange background and price cut sign
Adobe Creative Cloud is 65% off for students - just in time for back to school
Adobe Lightroom Generative Remove tool
Adobe Lightroom's new Generative Remove AI tool makes Content-aware Fill feel basic – and gives you one less reason to use Photoshop
Final Cut Pro update on iPad and Mac
Apple's new Final Cut Pro apps turn the iPad into an impressive live multicam studio
A laptop screen showing AI video editing tools in Adobe Premiere Pro
Watch this: Adobe shows how OpenAI's Sora will change Premiere Pro and video editing forever
Latest in News
Fujfilm GFX 50R
First Fujifilm GFX100RF images leaked in build-up to expected reveal – here’s what they tell us about the unique premium compact camera
Samsung Galaxy Z Flip 6 in blue
The Samsung Galaxy Z Flip 7 could have a Motorola Razr-style full-sized cover screen – and I think it’s about time
An AMD Radeon RX 9070 XT made by Sapphire on a table with its retail packaging
Last-minute AMD RX 9070 XT stock rumors are making me hopeful for a much better launch than Nvidia’s RTX 5000 GPUs – with just one snag
eSIM
Global eSIM shipment volume surpasses half a billion units as demand keeps on growing
Samsung Galaxy Buds in white
Samsung may be working on new cheap wireless earbuds – will the Galaxy Buds FE 2 beat Sony's next value earbuds to the punch?
PS5 Pro feature
PlayStation Direct now lets you rent, yes rent, a PS5 from £11.99 a month