Tag: val

  • Simon Willison’s Weblog: Quoting Adam Gordon Bell

    Source URL: https://simonwillison.net/2025/Jul/3/adam-gordon-bell/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Adam Gordon Bell Feedly Summary: I think that a lot of resistance to AI coding tools comes from the same place: fear of losing something that has defined you for so long. People are reacting against overblown hype, and there is overblown hype. I get that,…

  • The Register: AI models just don’t understand what they’re talking about

    Source URL: https://www.theregister.com/2025/07/03/ai_models_potemkin_understanding/ Source: The Register Title: AI models just don’t understand what they’re talking about Feedly Summary: Researchers find models’ success at tests hides illusion of understanding Researchers from MIT, Harvard, and the University of Chicago have proposed the term “potemkin understanding" to describe a newly identified failure mode in large language models that…

  • Simon Willison’s Weblog: TIL: Rate limiting by IP using Cloudflare’s rate limiting rules

    Source URL: https://simonwillison.net/2025/Jul/3/rate-limiting-by-ip/#atom-everything Source: Simon Willison’s Weblog Title: TIL: Rate limiting by IP using Cloudflare’s rate limiting rules Feedly Summary: TIL: Rate limiting by IP using Cloudflare’s rate limiting rules My blog started timing out on some requests a few days ago, and it turned out there were misbehaving crawlers that were spidering my /search/…

  • Simon Willison’s Weblog: Frequently Asked Questions (And Answers) About AI Evals

    Source URL: https://simonwillison.net/2025/Jul/3/faqs-about-ai-evals/#atom-everything Source: Simon Willison’s Weblog Title: Frequently Asked Questions (And Answers) About AI Evals Feedly Summary: Frequently Asked Questions (And Answers) About AI Evals Hamel Husain and Shreya Shankar have been running a paid, cohort-based course on AI Evals For Engineers & PMs over the past few months. Here Hamel collects answers to…

  • Docker: 5 Best Practices for Building, Testing, and Packaging MCP Servers 

    Source URL: https://www.docker.com/blog/mcp-server-best-practices/ Source: Docker Title: 5 Best Practices for Building, Testing, and Packaging MCP Servers  Feedly Summary: We recently launched a new, reimagined Docker MCP Catalog with improved discovery and a new submission process. Containerized MCP servers offer a secure way to run and scale agentic applications and minimize risks tied to host access…

  • Simon Willison’s Weblog: Sandboxed tools in a loop

    Source URL: https://simonwillison.net/2025/Jul/3/sandboxed-tools-in-a-loop/#atom-everything Source: Simon Willison’s Weblog Title: Sandboxed tools in a loop Feedly Summary: Something I’ve realized about LLM tool use is that it means that if you can reduce a problem to something that can be solved by an LLM in a sandbox using tools in a loop, you can brute force that…

  • CSA: What We Can Learn from the 2024 CrowdStrike Outage

    Source URL: https://cloudsecurityalliance.org/articles/what-we-can-learn-from-the-2024-crowdstrike-outage Source: CSA Title: What We Can Learn from the 2024 CrowdStrike Outage Feedly Summary: AI Summary and Description: Yes **Summary:** The analysis of the CrowdStrike outage in July 2024 highlights significant vulnerabilities within centralized cloud security solutions and their ripple effects on numerous organizations. The incident underscores the critical need for thorough…

  • CSA: What MITRE ATT&CK v17 Means for ESXi Security

    Source URL: https://valicyber.com/resources/mitre-attck-v17-esxi/ Source: CSA Title: What MITRE ATT&CK v17 Means for ESXi Security Feedly Summary: AI Summary and Description: Yes Summary: The article discusses the introduction of the ESXi matrix in MITRE ATT&CK v17, emphasizing its significance for securing hypervisors as critical attack surfaces. It identifies high-risk TTPs (Tactics, Techniques, and Procedures) specific to…