Xorte logo

News Markets Groups

USA | Europe | Asia | World| Stocks | Commodities



Add a new RSS channel

 
 


Keywords

2024-07-24 19:29:49| Engadget

When Reddit said last month that it would block unauthorized data scraping from its site, everyones (rightful) first reaction was AI, AI, AI. However, now that the change has taken effect, chatbot makers arent the only ones being locked out. The widely used forum also appears to be blocking all search engines other than Google, which reportedly inked a deal earlier this year with Reddit worth $60 million annually. 404 Media reported on Wednesday (and Engadget confirmed in our queries) that searching for Reddit results from the past week on rival engine Bing (using site:reddit.com) returns empty results. The publication reported that DuckDuckGo produced seven links without any descriptions, only providing the note, We would like to show you a description here but the site wont allow us. The engine now appears to have removed even those, as our test only produced an empty page, reading, no results found. When Reddit said last month that it would update its Robots Exclusion Protocol (robots.txt) to block automated data scraping, its now apparent that it wasnt only meant to thwart AI companies like Perplexity and its controversial answer engine. Currently, Google appears to be the only search engine allowed to crawl Reddit and produce results from the front page of the internet. Ironically, part of the forum websites robots.txt file reads, Reddit believes in an open internet, but not the misuse of public content. The file for Reddit now essentially says, Do not scrape. Apparently, it now considers search engines that dont buy into exclusive deals to be misusing its content. The ubiquitous robots.txt is the web standard that communicates which parts of a site can be crawled. Although many crawlers are known to ignore its instructions, Googles standard procedure is to respect it. So, on the technical side, the companies in cahoots on the lucrative deal appear to have deployed some manual override. Of course, the saga is a trickle-down effect of AI chatbots scraping the live web for results. With courts slow to determine how much of the open web is fair use to train chatbots on, companies like Reddit, whose bottom lines now depend on safeguarding their data from those who dont pay, are building walls at the expense of the open web. (Although, given the integral role Microsoft has played in this AI era, cozying up with OpenAI early on, it seems ironic that Bing finds itself on the losing end of at least one aspect of the fallout.) Colin Hayhurst, CEO of lesser-known no-tracking search engine Mojeek, told 404 Media that Reddit is killing everything for search but Google. In addition, the executive said his attempts to contact Reddit were ignored. Its never happened to us before, he said. Because this happens to us, we get blocked, usually because of ignorance or stupidity or whatever, and when we contact the site you certainly can get that resolved, but weve never had no reply from anybody before. Engadget asked Google and Reddit for comment and confirmation, but we hadnt heard back by publication. 404 Media reported running into a similar wall of silence from the companies. Reddit has made no secret of its desire to block AI companies from scraping its treasure trove of data in this burgeoning age of AI. Last year, CEO Steve Huffman risked alienating large portions of its user base by blocking third-party API requests, leading to the demise of beloved apps like Christian Seligs Apollo. Despite widespread protests among moderators and forum-goers, the company only temporarily lost negligible numbers of users. The gamble appeared to pay off, and Reddit recovered. It went public in March.This article originally appeared on Engadget at https://www.engadget.com/search-engines-that-dont-pay-up-cant-index-reddit-content-172949170.html?src=rss


Category: Marketing and Advertising

 

Latest from this category

24.01How to use Workout Buddy with Apple Watch and iOS 26
24.01Engadget review recap: Valerion VisionMaster Max, Canon EOS R6 III and Samsung Bespoke Fridge
24.01More Cult of the Lamb, a World War II computer mystery and other new indie games worth checking out
23.01Google Photos can now turn you into a meme
23.01A rival smart glasses company is suing Meta over its Ray-Ban products
23.01Retro handheld maker Anbernic has a new gamepad with a screen and heart rate sensor
23.01Apple will begin showing more App Store ads starting in March
23.01Vimeo lays off most of its staff just months after being bought by private equity firm
Marketing and Advertising »

All news

24.01How to use Workout Buddy with Apple Watch and iOS 26
24.01'People are often in despair - we see it in their eyes'
24.01Engadget review recap: Valerion VisionMaster Max, Canon EOS R6 III and Samsung Bespoke Fridge
24.01More Cult of the Lamb, a World War II computer mystery and other new indie games worth checking out
24.01Mala Gaonkars hedge fund assets hit $6 billion in three years
24.01FPIs dump Indian equities worth Rs 33,598 cr in Jan so far. Is the sentiment set to worsen further?
24.01Dalal Street Week Ahead: Volatility to stay elevated, traders urged to stay nimble
24.01Adani Group clarifies on US SEC summons report; says no allegations against company
More »
Privacy policy . Copyright . Contact form .