Reddit has initiated legal proceedings against companies alleged to have scraped user-generated data from its platform without authorization. The lawsuit centers on the use of Reddit’s publicly accessible content to train large language models (LLMs), which Reddit claims violates its API terms of service and intellectual property rights.
Reddit revised its API terms in 2023, introducing monetization requirements for entities using its data for commercial purposes, particularly AI training. These changes followed increased interest from AI developers in using Reddit’s diverse, real-world data to improve LLM performance.
Despite these policy updates, Reddit asserts that some companies continued to extract content from its platform without API access or a commercial license, effectively bypassing paywalls and violating terms of service.
Legal Basis of Reddit’s Claim
Reddit’s legal position is based on:
- Breach of Contract: Violating its API Terms of Service.
- Circumvention of Access Controls: Potential breaches under the U.S. Computer Fraud and Abuse Act (CFAA).
- Unfair Commercial Use: Exploiting user-generated data for AI model training without consent or compensation.
Broader Implications
This case intersects with unresolved issues in AI law and data rights:
- Fair Use Doctrine: AI firms may argue public data is subject to fair use. U.S. courts have yet to establish clear boundaries for AI training under fair use.
- Platform Liability: Raises questions about whether platforms like Reddit can claim ownership over user-generated content and restrict downstream use.
- User Privacy: Even anonymized content may carry sensitive information, raising GDPR and consumer protection concerns globally.
Relevance to Canadian and Global Regulation
With Canada’s proposed Artificial Intelligence and Data Act (AIDA) and similar international AI regulations under development, the Reddit case underscores the need for clearer definitions of:
- Lawful data collection practices
- Consent for AI training
- Platform vs. user content ownership
As regulatory frameworks evolve, lawsuits like Reddit’s are expected to shape future compliance expectations for AI developers and content-hosting platforms.
Source:
Reddit API Terms of Service, 2024; U.S. Federal Court Filings (2025)