Tag: robustness
-
Simon Willison’s Weblog: Trading Inference-Time Compute for Adversarial Robustness
Source URL: https://simonwillison.net/2025/Jan/22/trading-inference-time-compute/ Source: Simon Willison’s Weblog Title: Trading Inference-Time Compute for Adversarial Robustness Feedly Summary: Trading Inference-Time Compute for Adversarial Robustness Brand new research paper from OpenAI, exploring how inference-scaling “reasoning" models such as o1 might impact the search for improved security with respect to things like prompt injection. We conduct experiments on the…
-
OpenAI : Trading inference-time compute for adversarial robustness
Source URL: https://openai.com/index/trading-inference-time-compute-for-adversarial-robustness Source: OpenAI Title: Trading inference-time compute for adversarial robustness Feedly Summary: Trading Inference-Time Compute for Adversarial Robustness AI Summary and Description: Yes Summary: The text explores the trade-offs between inference-time computing demands and adversarial robustness within AI systems, particularly relevant in the context of machine learning and AI security. This topic holds…
-
Hacker News: Skyvern Browser Agent 2.0: How We Reached State of the Art in Evals
Source URL: https://blog.skyvern.com/skyvern-2-0-state-of-the-art-web-navigation-with-85-8-on-webvoyager-eval/ Source: Hacker News Title: Skyvern Browser Agent 2.0: How We Reached State of the Art in Evals Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the launch of Skyvern 2.0, an advanced autonomous web agent that achieves a benchmark score of 85.85% on the WebVoyager Eval. It details…
-
Simon Willison’s Weblog: Quoting gwern
Source URL: https://simonwillison.net/2025/Jan/16/gwern/#atom-everything Source: Simon Willison’s Weblog Title: Quoting gwern Feedly Summary: […] much of the point of a model like o1 is not to deploy it, but to generate training data for the next model. Every problem that an o1 solves is now a training data point for an o3 (eg. any o1 session…
-
Hacker News: The Canva outage: another tale of saturation and resilience
Source URL: https://surfingcomplexity.blog/2024/12/21/the-canva-outage-another-tale-of-saturation-and-resilience/ Source: Hacker News Title: The Canva outage: another tale of saturation and resilience Feedly Summary: Comments AI Summary and Description: Yes Summary: The incident at Canva, detailed by Brendan Humphries, highlights a series of interconnected failures that led to a significant service outage. Notably, a CDN misconfiguration and an API gateway performance…
-
The Register: To save the energy grid from AI, use open source AI, says open source body
Source URL: https://www.theregister.com/2025/01/09/linux_foundation_ai_energy_report/ Source: The Register Title: To save the energy grid from AI, use open source AI, says open source body Feedly Summary: Linux Foundation Energy argues rapidly decentralizing electricity sector can’t succeed with silos The energy industry needs to adopt open source AI software, and the collaborative processes used to create it, to…