Executive Director at Common Crawl
Publicly made the case that AI models should be able to train on Common Crawl's web archive data.
How media typically covers Rich Skrenta
Referenced in coverage
Common Crawl Foundation is providing paywalled news articles from major publishers to AI companies like OpenAI and Google for model training while publicly claiming it only scrapes freely available content and misrepresenting compliance with publisher removal requests.
“Publicly made the case that AI models should be able to train on Common Crawl's web archive data.”