« Back403ing AI Crawlerscoryd.devSubmitted by cdme 6 months ago
  • delifue 6 months ago

    In my opinion, the best way of fighting with crawlers is not giving error feedback (403). The best way is to give the crawlers low-quality AI-generated data.

    • marcus0x62 6 months ago

      Self plug, but I made this to deal with bots on my site: https://marcusb.org/hacks/quixotic.html. It is a simple markov generator to obfuscate content (static-site friendly, no server-side dynamic generation required) and an optional link-maze to send incorrigible bots to 100% markov-generated non-sense (requires a server-side component.)

      I do serve a legit robots.txt file to warn the scrapers I know about away.