SELECT LANGUAGE BELOW

Reddit gives Google access to content for training AI models

Reddit has agreed to give Google access to its content to train the tech giant’s artificial intelligence (AI) models, the social message board site announced Thursday.

The deal is reportedly worth $60 million and will give Google access to Reddit’s data application programming interface (API).

“The Reddit Data API allows Google to efficiently and structuredly access newer information, help us better understand content on Reddit, and help us display, train, and otherwise find the most accurate and relevant content.” “You’ll have access to enhanced signals that will help you use your search in a better way,” Rajan Patel, Google’s vice president of search experience, said in a news release.

The deal also aims to “make it easier for people to find and access the communities and conversations they’re looking for on Reddit” by developing new ways to display content across Google products, Reddit said. I mentioned it in a blog post.

“This enhanced collaboration will give Google efficient and structured access to the vast corpus of existing content on Reddit, allowing them to use the Reddit Data API to improve their products and services. This includes supporting new ways to display Reddit content and more efficient ways to “train models,” the platform added.

However, Reddit stressed that the deal does not change the terms of its API policy, which was updated last year to prevent companies from using APIs for commercial purposes without prior approval.

Last fall, the site reportedly considered blocking search crawlers from Google and Bing if they could not reach an agreement to pay data access fees. washington post.

The platform, which also filed to go public on Thursday, said in its IPO prospectus that it signed data licensing deals worth a total of $203 million in January.

“We are also in the early stages of monetizing new opportunities in data licensing by allowing third parties to access, search and analyze data on our platform.”

“Data on Reddit is constantly growing and regenerating as users interact with the community and with each other,” it added. “We believe our growing platform data will become a key element in training leading large-scale language models (“LLMs”) and serve as an additional monetization channel for Reddit. ”

Copyright 2024 Nexstar Media Inc. All rights reserved. This material may not be published, broadcast, rewritten, or redistributed.

Facebook
Twitter
LinkedIn
Reddit
Telegram
WhatsApp

Related News