SELECT LANGUAGE BELOW

Reddit restricts Internet Archive due to worries about AI data collection

Reddit restricts Internet Archive due to worries about AI data collection

Reddit Limits Access to Data After AI Scraping Concerns

As artificial intelligence models become more prevalent, the need for extensive data is surging. In response, some companies are tightening their policies around AI data scraping, particularly if fees are not paid.

Reddit has announced plans to restrict access to the Internet Archive Wayback Machine for communication platforms due to claims that AI firms are using data from Reddit’s site without permission. This restriction mainly affects the archival storage of Reddit’s homepage.

“Until we can ensure the protection of our sites and adherence to platform policies, we’re limiting access to some Reddit data for the sake of our users,” said a Reddit spokesperson.

These restrictions were reported to commence on a “ramp-up” basis starting Monday, but Reddit has not disclosed which AI companies are involved in this data scraping.

Reddit stated that while the Internet Archive offers services beneficial to the open web, they have observed that AI companies are infringing upon their platform rules and diminishing the data available from Wayback machines.

Some Reddit users feel this move is contrary to the vision of co-founder Aaron Swartz, who dedicated his life to making online content freely accessible. He tragically took his life shortly before facing trial for allegedly accessing and downloading millions of academic journal articles.

“Aaron is probably rolling in his grave,” a user commented.

Tim Rathschmidt from Reddit emphasized that the changes are intended to safeguard users. “We’re taking these steps to protect Redditors until we can guarantee compliance with our policies, including respect for user privacy and the removal of deleted content.”

However, some speculate that the decision may have financial motivations, especially considering past legal actions Reddit has taken against various AI companies for underpayment. Earlier this year, Reddit formed a partnership with OpenAI but then took legal action against other firms for not meeting requirements.

Mark Graham, director of the Wayback Machine, acknowledged a longstanding engagement with Reddit and mentioned ongoing discussions about these challenges.

Facebook
Twitter
LinkedIn
Reddit
Telegram
WhatsApp

Related News