SEO (jam+/htt) | | 18 |
web-almanac, studies, research, metrics, seo, metadata, structured-data, amp, internationalization |
Why I Don’t Block AI Scrapers (j9t) | | 17 |
ai, scraping |
Google Quietly Launches New AI Crawler (mar/sea) | | 16 |
google, ai, crawling |
Websites Are Blocking the Wrong AI Scrapers (Because AI Companies Keep Making New Ones) (jas/404) | | 15 |
ai, scraping |
The Backlash Against AI Scraping Is Real and Measurable (jas/404) | | 14 |
ai, scraping |
AI Unplugged: Rise (and Fall) of the Robots(.txt) | | 13 |
ai, scraping |
Investigating Reddit’s robots.txt Cloaking Strategy (rya/mer) | | 12 |
scraping, web |
AI Companies Ignoring robots.txt (mjt) | | 11 |
ai, scraping |
Go Ahead and Block AI Web Crawlers (cor) | | 10 |
crawling, scraping, ai |
The Text File That Runs the Internet (dav/ver) | | 9 |
crawling, scraping, ai, web |
Dark Visitors (ghk) | | 8 |
websites, ai, scraping |
Crawlers (ada) | | 7 |
crawling, ai |
Block the Bots That Feed “AI” Models by Scraping Your Website (cla) | | 6 |
scraping, ai |
OpenAI Launches Web Crawling GPTBot, Sparking Blocking Effort by Website Owners and Creators (ven) | | 5 |
ai, openai, crawling, scraping |
Titles, “meta” Tags, “link” Tags, and Search Engine Robots | | 4 |
html, metadata, seo |
robots.txt Validator (Logeix) | | 3 |
tools, analysis, conformance, seo |
robots.txt Generator (nin) | | 2 |
tools, exploration, code-generation, seo |
robots.txt Validator (Merkle) (max/mer) | | 1 |
tools, analysis, conformance, seo |