Tag: performance

  • Simon Willison’s Weblog: Devstral

    Source URL: https://simonwillison.net/2025/May/21/devstral/#atom-everything Source: Simon Willison’s Weblog Title: Devstral Feedly Summary: Devstral New Apache 2.0 licensed LLM release from Mistral, this time specifically trained for code. Devstral achieves a score of 46.8% on SWE-Bench Verified, outperforming prior open-source SoTA models by more than 6% points. When evaluated under the same test scaffold (OpenHands, provided by…

  • Simon Willison’s Weblog: Gemini Diffusion

    Source URL: https://simonwillison.net/2025/May/21/gemini-diffusion/ Source: Simon Willison’s Weblog Title: Gemini Diffusion Feedly Summary: Gemini Diffusion Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google’s first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers. Google describe it like this: Traditional autoregressive language models generate text…

  • The Register: Microsoft-backed AI out-forecasts hurricane experts without crunching the physics

    Source URL: https://www.theregister.com/2025/05/21/earth_system_model_hurricane_forecast/ Source: The Register Title: Microsoft-backed AI out-forecasts hurricane experts without crunching the physics Feedly Summary: LLM trained on decades of weather data claimed to be faster, and cheaper Scientists have developed a machine learning model that can outperform official agencies at predicting tropical cyclone tracks, and do it faster and cheaper than…

  • Cloud Blog: The AI-driven telecom: A new era of network transformation

    Source URL: https://cloud.google.com/blog/topics/telecommunications/the-ai-driven-telecom-a-new-era-of-network-transformation/ Source: Cloud Blog Title: The AI-driven telecom: A new era of network transformation Feedly Summary: The telecommunications industry is undergoing a profound transformation, with AI and generative AI emerging as key catalysts. Communication service providers (CSPs) are increasingly recognizing that these technologies are not merely incremental improvements but fundamental drivers for achieving…

  • OpenAI : New tools and features in the Responses API

    Source URL: https://openai.com/index/new-tools-and-features-in-the-responses-api Source: OpenAI Title: New tools and features in the Responses API Feedly Summary: New features in the Responses API: Remote MCP, image gen, Code Interpreter, and more. Powering faster, smarter agents with GPT-4o & o-series models, plus new features for reliability and efficiency. AI Summary and Description: Yes Summary: The text discusses…

  • The Register: Nvidia CEO Jensen Huang labels US GPU export bans ‘precisely wrong’ and ‘a failure’

    Source URL: https://www.theregister.com/2025/05/21/jensen_huang_h20_ban_criticism/ Source: The Register Title: Nvidia CEO Jensen Huang labels US GPU export bans ‘precisely wrong’ and ‘a failure’ Feedly Summary: Argues the world needs China’s AI researchers working on his chips so the rest of us benefit Computex Nvidia CEO Jensen Huang has said the USA’s ban on exports of his company’s…

  • Simon Willison’s Weblog: Gemini 2.5: Our most intelligent models are getting even better

    Source URL: https://simonwillison.net/2025/May/20/gemini-25/#atom-everything Source: Simon Willison’s Weblog Title: Gemini 2.5: Our most intelligent models are getting even better Feedly Summary: Gemini 2.5: Our most intelligent models are getting even better A bunch of new Gemini 2.5 announcements at Google I/O today. 2.5 Flash and 2.5 Pro are both getting audio output (previously previewed in Gemini…