Skip Navigation
185 comments
    • Around here we love the idea of Reddit being totally devoid of life but the fact is it's still one of the most active public facing sites on the web. The attrition to sites like Lemmy is pretty negligible to the overall Reddit activity and bot AI activity only really affects the largest subreddits which have always been a bit spammy and click batey. The medium and small subreddits are still full of active people. Don't get me wrong, Lemmy is my daily driver for this content but I won't pretend everyone fled Reddit for this.

      Additionally, exclusivity with Google isn't necessary just to keep the search results but to prevent their biggest AI competition ChatGPT and their ties to Microsoft from getting access to what is the Internet's largest database of public facing conversation.

    • I wonder what kind of contract they went with.

      https://www.reuters.com/technology/reddit-ai-content-licensing-deal-with-google-sources-say-2024-02-22/

      SAN FRANCISCO, Feb 21 (Reuters) - Social media platform Reddit has struck a deal with Google (GOOGL.O) , opens new tab to make its content available for training the search engine giant's artificial intelligence models, three people familiar with the matter said.

      The contract with Alphabet-owned Google is worth about $60 million per year, according to one of the sources.

      For perspective:

      https://www.cbsnews.com/news/google-reddit-60-million-deal-ai-training/

      In documents filed with the Securities and Exchange Commission, Reddit said it reported net income of $18.5 million — its first profit in two years — in the October-December quarter on revenue of $249.8 million.

      So if you annualize that, Reddit's seeing revenue of about $1 billion/year, and net income of about $74 million/year.

      Given that Reddit granting exclusive indexing to Google happened at about the same time, I would assume that that AI-training deal included the exclusivity indexing agreement, but maybe it's separate.

      My gut feeling is that the exclusivity thing is probably worth more than $60 million/year, that Google's probably getting a pretty good deal. Like, Google did not buy Reddit, and Google's done some pretty big acquisitions, like YouTube, and that'd have been another way for Google to get exclusive access. So I'd think that this deal is probably better for Google than buying Reddit. Reddit's market capitalization is $10 billion, so Google is maybe paying 0.6% the value of Reddit per year to have exclusive training rights to their content and to be the only search engine indexing them; aside from Reddit users themselves running into content in subreddits, I'd guess that those two forms are probably the main way in which one might leverage the content there.

      Plus, my impression is that the idea that a number of companies have -- which may or may not be valid -- is that this is the beginning of the move away from search engines. Like, the idea is that down the line, the typical person doesn't use a search engine to find a webpage somewhere that's a primary source to find material. Instead, they just query an AI. That compiles all the data that it can see and spits out an answer. Saves some human searcher time and reduces complexity, and maybe can solve some problems if AIs can ultimately do a better job of filtering out erroneous information than humans. We definitely aren't there yet in 2024, but if that's where things are going, I think that it might make a lot of strategic sense for Google. If Google can lock up major sources of training data, keep Microsoft out, then it's gonna put Microsoft in a difficult spot if Microsoft is gunning for the same thing.

    • At least on some smaller subs, there seems to be a suspicious amount of brand new accounts asking one question to get human answers.
      It would not surprise me if reddit, or some other service, are seeding to get more LLM-able content. Of course, this might backfire if people start giving stupid answers to eff up the data.

  • Makes sense they've spent years curating other people's content and are now selling it..... Oh wait 😯.

  • It's still possible to search with "site:reddit.com ..."

    Has it been implemented yet or are they blocking non-flagged searches? Which seems odd.

    • You shouldn't be getting any new results if you do that, older posts will/may remain indexed.

185 comments