Tag: Aria

  • Simon Willison’s Weblog: GPT‑5-Codex and upgrades to Codex

    Source URL: https://simonwillison.net/2025/Sep/15/gpt-5-codex/#atom-everything Source: Simon Willison’s Weblog Title: GPT‑5-Codex and upgrades to Codex Feedly Summary: GPT‑5-Codex and upgrades to Codex OpenAI half-released a new model today: GPT‑5-Codex, a fine-tuned GPT-5 variant explicitly designed for their various AI-assisted programming tools. I say half-released because it’s not yet available via their API, but they “plan to make…

  • OpenAI : Addendum to GPT-5 system card: GPT-5-Codex

    Source URL: https://openai.com/index/gpt-5-system-card-addendum-gpt-5-codex Source: OpenAI Title: Addendum to GPT-5 system card: GPT-5-Codex Feedly Summary: This addendum to the GPT-5 system card shares a new model: GPT-5-Codex, a version of GPT-5 further optimized for agentic coding in Codex. GPT-5-Codex adjusts its thinking effort more dynamically based on task complexity, responding quickly to simple conversational queries or…

  • The Register: Spectre haunts CPUs again: VMSCAPE vulnerability leaks cloud secrets

    Source URL: https://www.theregister.com/2025/09/11/vmscape_spectre_vulnerability/ Source: The Register Title: Spectre haunts CPUs again: VMSCAPE vulnerability leaks cloud secrets Feedly Summary: AMD Zen hardware and Intel Coffee Lake affected If you thought the world was done with side-channel CPU attacks, think again. ETH Zurich has identified yet another Spectre-based transient execution vulnerability that affects AMD Zen CPUs and…

  • Simon Willison’s Weblog: Defeating Nondeterminism in LLM Inference

    Source URL: https://simonwillison.net/2025/Sep/11/defeating-nondeterminism/#atom-everything Source: Simon Willison’s Weblog Title: Defeating Nondeterminism in LLM Inference Feedly Summary: Defeating Nondeterminism in LLM Inference A very common question I see about LLMs concerns why they can’t be made to deliver the same response to the same prompt by setting a fixed random number seed. Like many others I had…

  • Cloud Blog: Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer

    Source URL: https://cloud.google.com/blog/products/compute/ai-inference-recipe-using-nvidia-dynamo-with-ai-hypercomputer/ Source: Cloud Blog Title: Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer Feedly Summary: As generative AI becomes more widespread, it’s important for developers and ML engineers to be able to easily configure infrastructure that supports efficient AI inference, i.e., using a trained AI model to make…

  • Scott Logic: Reading The Gentle Singularity Through a Sustainability Lens

    Source URL: https://blog.scottlogic.com/2025/09/09/reading-the-gentle-singularity-through-a-sustainability-lens.html Source: Scott Logic Title: Reading The Gentle Singularity Through a Sustainability Lens Feedly Summary: Sam Altman says AI will cost as little as electricity—but what’s the real environmental impact of billions of queries a day? In this post I dig into the numbers from the blog post “The Gentle Singularity" and compare…