Publishers are blocking the Internet Archive for fear AI scrapers can use it as a workaround

January 29, 2026

3

The Internet Archive has often been a valuable resource for journalists, from it’s finding records of deleted tweets or providing academic texts for background research. However, the advent of AI has created a new tension between the parties. A few major publications have begun blocking the nonprofit digital library’s access to their content based on concerns that AI companies’ bots are using the Internet Archive’s collections to indirectly scrape their articles.

“A lot of these AI businesses are looking for readily available, structured databases of content,” Robert Hahn, head of business affairs and licensing for The Guardian, told Nieman Lab. “The Internet Archive’s API would have been an obvious place

→ Continue reading at Engadget

Publishers are blocking the Internet Archive for fear AI scrapers can use it as a workaround

Similar Articles

Most Popular

Publishers are blocking the Internet Archive for fear AI scrapers can use it as a workaround

Similar Articles

xAI is being sued by teens who say Grok created CSAM using their photos

NVIDIA claims DLSS 5 will deliver ‘photoreal’ image quality with AI this fall

Most Popular

Live Nation settles government antitrust suit — and dodges a breakup

FBI most wanted suspect in deadly Federal Way bar shooting captured in Mexico

15-year-old arrested after 17-year-old shot in Lynnwood