Over the course of its 18 years of existence, Reddit has accumulated a veritable treasury of human interactions and conversations. These vast data pools have provided businesses with the ideal environment for training AI chatbots and massive language models. Now that it has decided it wants a piece of the AI pie, Reddit will begin charging businesses for access to its API, which is required to train LLMs.

In any case, these are only some of the kinds of mom-and-pop businesses that would use the API to train AI chatbots. Reddit is used by industry giants such as Google and OpenAI to help provide first guidance to artificial intelligence services that are just starting. To achieve this objective, Reddit is launching what the company describes as a “new premium access point for third parties,” as stated in an official release.
The pricing is not yet finalized at this time. On the other hand, Reddit stated that it would be segmented into tiers, which will most likely support businesses of varying sizes. As points of differentiation between the tiers, the social networking platform highlights many different usage limits and larger usage rights.
“The Reddit corpus of data is really valuable,” said Steve Huffman, the creator, and chief executive officer of Reddit, in an interview with The New York Times. “However, there is no reason for us to give all of that value away for free to some of the largest companies in the world.”

Data scrapers like Common Crawl are frequently utilized as chatbot tutors. Reddit is not the only online depository of material used to train massive language models. On the other hand, Common Crawl and other such businesses deal with raw data, which can be considered enormous pools of web information. Conversations between real people are what you’ll find on Reddit. A well-rounded artificial intelligence needs access to both sorts of data to improve its factual accuracy and more closely mimic human behavior.
Reddit’s application program interface (API) is frequently utilized in developing and maintaining tools for content moderation. Instead of charging content moderators a fee to access the API, the company is developing specialized moderation tools for mobile iOS and Android mobile applications. The applications will come packed with a mod log, rules management tools, information on mod queues, and much more.
Why make this adjustment at this time? It seems like quickly, AI went from being a specialized field to a major industry, and whispers are going around that Reddit is planning to go public later this year. When presenting an initial public offering (IPO), it is almost always a smart idea to set up new revenue streams.

