Skip to content

Commit

Permalink
AI my ass, fku
Browse files Browse the repository at this point in the history
  • Loading branch information
filmaj committed Jun 28, 2024
1 parent 1f3a247 commit 16a8066
Showing 1 changed file with 30 additions and 0 deletions.
30 changes: 30 additions & 0 deletions robots.txt
Original file line number Diff line number Diff line change
@@ -1,2 +1,32 @@
User-agent: GPTBot
Disallow: /

## Meta’s bot that crawls public web pages to improve language models for their speech recognition technology.
User-agent: FacebookBot
Disallow: /

## Apple very kindly told us how to block their scraper AFTER they'd scraped everything.
User-agent: Applebot-Extended
Disallow: /

## is used by used by Anthropic to gather data for their “AI” products, such as Claude
User-agent: anthropic-ai
Disallow: /

## is another agent used by Anthropic that is more specifically related to Claude
User-agent: ClaudeBot
Disallow: /

# is a somewhat dishonest scraping bot used to collect data to train LLMs. This is their default user-agent, but they make it easy for their clients to change it to something else and ignore your wishes
User-agent: Diffbot
Disallow: /

## This is just getting stupid and I hope governments step in to wreck these tech-bro thieves.
User-agent: Bytespider
Disallow: /
User-agent: ImagesiftBot
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: cohere-ai
Disallow: /

0 comments on commit 16a8066

Please sign in to comment.