Tag: crawler

  • Simon Willison’s Weblog: OpenAI API: Responses vs. Chat Completions

    Source URL: https://simonwillison.net/2025/Mar/11/responses-vs-chat-completions/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI API: Responses vs. Chat Completions Feedly Summary: OpenAI API: Responses vs. Chat Completions OpenAI released a bunch of new API platform features this morning under the headline “New tools for building agents" (their somewhat mushy interpretation of "agents" here is "systems that independently accomplish tasks on…

  • Simon Willison’s Weblog: llm-openrouter 0.4

    Source URL: https://simonwillison.net/2025/Mar/10/llm-openrouter-04/ Source: Simon Willison’s Weblog Title: llm-openrouter 0.4 Feedly Summary: llm-openrouter 0.4 I found out this morning that OpenRouter include support for a number of (rate-limited) free API models. I occasionally workshops on top of LLMs (like this one) and being able to provide students with a quick way to obtain an API…

  • Hacker News: Moscow-based global news network has infected Western AI tools

    Source URL: https://www.newsguardrealitycheck.com/p/a-well-funded-moscow-based-global Source: Hacker News Title: Moscow-based global news network has infected Western AI tools Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses a disinformation network, “Pravda,” that is manipulating AI chatbots by flooding them with false narratives and propaganda, resulting in a significant percentage of chatbot outputs containing disinformation.…

  • The Cloudflare Blog: No hallucinations here: track the latest AI trends with expanded insights on Cloudflare Radar

    Source URL: https://blog.cloudflare.com/expanded-ai-insights-on-cloudflare-radar/ Source: The Cloudflare Blog Title: No hallucinations here: track the latest AI trends with expanded insights on Cloudflare Radar Feedly Summary: Today, we are launching a new dedicated “AI Insights” page on Cloudflare Radar that incorporates this graph and builds on it with additional metrics. AI Summary and Description: Yes **Short Summary…

  • Hacker News: AI haters build tarpits to trap and trick AI scrapers that ignore robots.txt

    Source URL: https://arstechnica.com/tech-policy/2025/01/ai-haters-build-tarpits-to-trap-and-trick-ai-scrapers-that-ignore-robots-txt/ Source: Hacker News Title: AI haters build tarpits to trap and trick AI scrapers that ignore robots.txt Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the creation of a new malware named Nepenthes, designed by a software developer to combat AI web crawlers that ignore “no scraping” directives…

  • Slashdot: Developer Creates Infinite Maze That Traps AI Training Bots

    Source URL: https://slashdot.org/story/25/01/23/2135205/developer-creates-infinite-maze-that-traps-ai-training-bots?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Developer Creates Infinite Maze That Traps AI Training Bots Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the development of an open-source program called Nepenthes, designed to trap AI web crawlers in an endless loop of link generation, effectively wasting their resources. This innovative approach provides…

  • The Register: OpenAI’s ChatGPT crawler can be tricked into DDoSing sites, answering your queries

    Source URL: https://www.theregister.com/2025/01/19/openais_chatgpt_crawler_vulnerability/ Source: The Register Title: OpenAI’s ChatGPT crawler can be tricked into DDoSing sites, answering your queries Feedly Summary: The S in LLM stands for Security OpenAI’s ChatGPT crawler appears to be willing to initiate distributed denial of service (DDoS) attacks on arbitrary websites, a reported vulnerability the tech giant has yet to…

  • Hacker News: Nepenthes is a tarpit to catch AI web crawlers

    Source URL: https://zadzmo.org/code/nepenthes/ Source: Hacker News Title: Nepenthes is a tarpit to catch AI web crawlers Feedly Summary: Comments AI Summary and Description: Yes Summary: The text describes “Nepenthes,” a tarpit software devised to trap web crawlers, particularly those scraping data for large language models (LLMs). It offers unique functionalities and deployment setups, with explicit…