Reddit is cracking down on AI bots


0

A screenshot of the Reddit mobile app.

In May, Reddit announced it would allow OpenAI to train its models on Reddit content for a price. Now, according to The Verge, Reddit will block most automated bots from accessing, learning from, and profiting from its data without a similar licensing agreement.

Reddit plans to do this by updating its robots.txt file, the “basic social contract of the web” that determines how web crawlers can access the site. Most nascent AI companies (including, at one point, OpenAI) train their models on content they’ve scraped from across the web without considering copyright or the Terms of Service of individual sites.

Per The Verge’s Alex Heath, search engines like Google got away with this form of scraping thanks to the “give-and-take” of Google sending traffic back to individual sites in exchange for the ability to crawl them for information. Now, AI companies are tipping the balance by taking that same information and providing it to users without sending them back to the sites the information came from.

Reddit’s chief legal officer, Ben Lee, told The Verge that the parameters of robots.txt are not legally enforceable but that publicizing Reddit’s intention to enforce its content policy is “a signal to those who don’t have an agreement with us that they shouldn’t be accessing Reddit data.”

In a blog post about the change, Reddit noted that “good faith actors – like researchers and organizations… will continue to have access to Reddit content for non-commercial use.” These include the Internet Archive, home to the Wayback Machine.


Like it? Share with your friends!

0

What's Your Reaction?

hate hate
0
hate
confused confused
0
confused
fail fail
0
fail
fun fun
0
fun
geeky geeky
0
geeky
love love
0
love
lol lol
0
lol
omg omg
0
omg
win win
0
win

0 Comments

Your email address will not be published. Required fields are marked *

Choose A Format
Personality quiz
Series of questions that intends to reveal something about the personality
Trivia quiz
Series of questions with right and wrong answers that intends to check knowledge
Poll
Voting to make decisions or determine opinions
Story
Formatted Text with Embeds and Visuals
List
The Classic Internet Listicles
Countdown
The Classic Internet Countdowns
Open List
Submit your own item and vote up for the best submission
Ranked List
Upvote or downvote to decide the best list item
Meme
Upload your own images to make custom memes
Video
Youtube and Vimeo Embeds
Audio
Soundcloud or Mixcloud Embeds
Image
Photo or GIF
Gif
GIF format