Tag: web crawlers

  • Slashdot: Developer Creates Infinite Maze That Traps AI Training Bots

    Source URL: https://slashdot.org/story/25/01/23/2135205/developer-creates-infinite-maze-that-traps-ai-training-bots?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Developer Creates Infinite Maze That Traps AI Training Bots Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the development of an open-source program called Nepenthes, designed to trap AI web crawlers in an endless loop of link generation, effectively wasting their resources. This innovative approach provides…

  • Hacker News: Nepenthes is a tarpit to catch AI web crawlers

    Source URL: https://zadzmo.org/code/nepenthes/ Source: Hacker News Title: Nepenthes is a tarpit to catch AI web crawlers Feedly Summary: Comments AI Summary and Description: Yes Summary: The text describes “Nepenthes,” a tarpit software devised to trap web crawlers, particularly those scraping data for large language models (LLMs). It offers unique functionalities and deployment setups, with explicit…

  • Wired: The Race to Block OpenAI’s Scraping Bots Is Slowing Down

    Source URL: https://www.wired.com/story/open-ai-publisher-deals-scraping-bots/ Source: Wired Title: The Race to Block OpenAI’s Scraping Bots Is Slowing Down Feedly Summary: OpenAI’s spree of licensing agreements is paying off already—at least in terms of getting publishers to lower their guard. AI Summary and Description: Yes Summary: The text examines the evolving relationship between AI companies, particularly OpenAI, and…

  • Hacker News: Major Sites Are Saying No to Apple’s AI Scraping

    Source URL: https://www.wired.com/story/applebot-extended-apple-ai-scraping/ Source: Hacker News Title: Major Sites Are Saying No to Apple’s AI Scraping Feedly Summary: Comments AI Summary and Description: Yes Summary: The article discusses Apple’s introduction of a tool, Applebot-Extended, which allows publishers to opt out of data usage for AI training. This change signals a shift in attitudes towards web…