Tag: robots

  • The Cloudflare Blog: Perplexity is using stealth, undeclared crawlers to evade website no-crawl directives

    Source URL: https://blog.cloudflare.com/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives/ Source: The Cloudflare Blog Title: Perplexity is using stealth, undeclared crawlers to evade website no-crawl directives Feedly Summary: Perplexity is repeatedly modifying their user agent and changing IPs and ASNs to hide their crawling activity, in direct conflict with explicit no-crawl preferences expressed by websites. AI Summary and Description: Yes Summary: The…

  • Slashdot: Perplexity is Using Stealth, Undeclared Crawlers To Evade Website No-Crawl Directives, Cloudflare Says

    Source URL: https://tech.slashdot.org/story/25/08/04/1459240/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives-cloudflare-says?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Perplexity is Using Stealth, Undeclared Crawlers To Evade Website No-Crawl Directives, Cloudflare Says Feedly Summary: AI Summary and Description: Yes Summary: The report highlights ethical concerns regarding the web crawling practices of the AI startup Perplexity. By using undetected methods to bypass website restrictions on automated access, this behavior…

  • Tomasz Tunguz: The Rise of the Agent Manager

    Source URL: https://www.tomtunguz.com/how-many-agents-can-you-manage/ Source: Tomasz Tunguz Title: The Rise of the Agent Manager Feedly Summary: If 2025 is the year of agents, then 2026 will surely belong to agent managers. Agent managers are people who can manage teams of AI agents. How many can one person successfully manage? I can barely manage 4 AI agents…

  • Simon Willison’s Weblog: TIL: Rate limiting by IP using Cloudflare’s rate limiting rules

    Source URL: https://simonwillison.net/2025/Jul/3/rate-limiting-by-ip/#atom-everything Source: Simon Willison’s Weblog Title: TIL: Rate limiting by IP using Cloudflare’s rate limiting rules Feedly Summary: TIL: Rate limiting by IP using Cloudflare’s rate limiting rules My blog started timing out on some requests a few days ago, and it turned out there were misbehaving crawlers that were spidering my /search/…

  • The Cloudflare Blog: From Googlebot to GPTBot: who’s crawling your site in 2025

    Source URL: https://blog.cloudflare.com/from-googlebot-to-gptbot-whos-crawling-your-site-in-2025/ Source: The Cloudflare Blog Title: From Googlebot to GPTBot: who’s crawling your site in 2025 Feedly Summary: From May 2024 to May 2025, crawler traffic rose 18%, with GPTBot growing 305% and Googlebot 96%. AI Summary and Description: Yes Summary: The text discusses the evolution of web crawlers, particularly focusing on the…

  • The Cloudflare Blog: Control content use for AI training with Cloudflare’s managed robots.txt and blocking for monetized content

    Source URL: https://blog.cloudflare.com/control-content-use-for-ai-training/ Source: The Cloudflare Blog Title: Control content use for AI training with Cloudflare’s managed robots.txt and blocking for monetized content Feedly Summary: Cloudflare is making it easier for publishers and content creators of all sizes to prevent their content from being scraped for AI training by managing robots.txt on their behalf. AI…

  • CSA: Navigating Cybersecurity in Indian Manufacturing

    Source URL: https://cloudsecurityalliance.org/articles/navigating-the-cybersecurity-imperative-in-india-s-manufacturing-ascent Source: CSA Title: Navigating Cybersecurity in Indian Manufacturing Feedly Summary: AI Summary and Description: Yes Summary: The text emphasizes the urgent need for enhanced cybersecurity measures in India’s manufacturing sector due to increased automation and the rise of cyber threats. It highlights the shift from viewing cybersecurity as a backend function to…

  • Slashdot: Google Rolls Out New Gemini Model That Can Run On Robots Locally

    Source URL: https://hardware.slashdot.org/story/25/06/24/2150256/google-rolls-out-new-gemini-model-that-can-run-on-robots-locally?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Google Rolls Out New Gemini Model That Can Run On Robots Locally Feedly Summary: AI Summary and Description: Yes Summary: Google DeepMind has introduced Gemini Robotics On-Device, an advanced language model allowing robots to execute complex tasks locally without needing internet access. This development is significant for AI security…

  • Slashdot: SoftBank’s Son Pitches $1 Trillion Arizona AI Hub

    Source URL: https://news.slashdot.org/story/25/06/20/2212217/softbanks-son-pitches-1-trillion-arizona-ai-hub?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: SoftBank’s Son Pitches $1 Trillion Arizona AI Hub Feedly Summary: AI Summary and Description: Yes Summary: The text discusses SoftBank Group’s founder Masayoshi Son’s initiative to create a $1 trillion industrial complex in Arizona focusing on robotics and artificial intelligence manufacturing. This venture is tied to discussions with TSMC…