Tag: large

  • Simon Willison’s Weblog: GPT-5-Codex

    Source URL: https://simonwillison.net/2025/Sep/23/gpt-5-codex/#atom-everything Source: Simon Willison’s Weblog Title: GPT-5-Codex Feedly Summary: GPT-5-Codex OpenAI half-relased this model earlier this month, adding it to their Codex CLI tool but not their API. Today they’ve fixed that – the new model can now be accessed as gpt-5-codex. It’s priced the same as regular GPT-5: $1.25/million input tokens, $10/million…

  • Simon Willison’s Weblog: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action

    Source URL: https://simonwillison.net/2025/Sep/23/qwen3-vl/ Source: Simon Willison’s Weblog Title: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action Feedly Summary: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action I’ve been looking forward to this. Qwen 2.5 VL is one of the best available open weight vision LLMs, so I had high hopes for Qwen 3’s vision models. Firstly, we…

  • OpenAI : OpenAI, Oracle, and SoftBank expand Stargate with five new AI datacenter sites

    Source URL: https://openai.com/index/five-new-stargate-sites Source: OpenAI Title: OpenAI, Oracle, and SoftBank expand Stargate with five new AI datacenter sites Feedly Summary: OpenAI, Oracle, and SoftBank announce five new Stargate AI datacenter sites, accelerating a $500B, 10-gigawatt U.S. infrastructure buildout to power next-generation AI and create tens of thousands of jobs. AI Summary and Description: Yes Summary:…

  • Slashdot: Journals Infiltrated With ‘Copycat’ Papers That Can Be Written By AI

    Source URL: https://science.slashdot.org/story/25/09/23/1825258/journals-infiltrated-with-copycat-papers-that-can-be-written-by-ai?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Journals Infiltrated With ‘Copycat’ Papers That Can Be Written By AI Feedly Summary: AI Summary and Description: Yes Summary: The text discusses a significant concern regarding the misuse of text-generating AI tools, such as ChatGPT and Gemini, in rewriting scientific papers and producing fraudulent research. This highlights the potential…

  • Cloud Blog: AI Innovators: How JAX on TPU is helping Escalante advance AI-driven protein design

    Source URL: https://cloud.google.com/blog/topics/customers/escalante-uses-jax-on-tpus-for-ai-driven-protein-design/ Source: Cloud Blog Title: AI Innovators: How JAX on TPU is helping Escalante advance AI-driven protein design Feedly Summary: As a Python library for accelerator-oriented array computation and program transformation, JAX is widely recognized for its power in training large-scale AI models. But its core design as a system for composable function…

  • Simon Willison’s Weblog: Why AI systems might never be secure

    Source URL: https://simonwillison.net/2025/Sep/23/why-ai-systems-might-never-be-secure/#atom-everything Source: Simon Willison’s Weblog Title: Why AI systems might never be secure Feedly Summary: Why AI systems might never be secure The Economist have a new piece out about LLM security, with this headline and subtitle: Why AI systems might never be secure A “lethal trifecta” of conditions opens them to abuse…

  • The Register: AI gone rogue: Models may try to stop people from shutting them down, Google warns

    Source URL: https://www.theregister.com/2025/09/22/google_ai_misalignment_risk/ Source: The Register Title: AI gone rogue: Models may try to stop people from shutting them down, Google warns Feedly Summary: Misalignment risk? That’s an area for future study Google DeepMind added a new AI threat scenario – one where a model might try to prevent its operators from modifying it or…

  • Simon Willison’s Weblog: Four new releases from Qwen

    Source URL: https://simonwillison.net/2025/Sep/22/qwen/ Source: Simon Willison’s Weblog Title: Four new releases from Qwen Feedly Summary: It’s been an extremely busy day for team Qwen. Within the last 24 hours (all links to Twitter, which seems to be their preferred platform for these announcements): Qwen3-Next-80B-A3B-Instruct-FP8 and Qwen3-Next-80B-A3B-Thinking-FP8 – official FP8 quantized versions of their Qwen3-Next models.…