Tag: generation

  • Simon Willison’s Weblog: Devstral

    Source URL: https://simonwillison.net/2025/May/21/devstral/#atom-everything Source: Simon Willison’s Weblog Title: Devstral Feedly Summary: Devstral New Apache 2.0 licensed LLM release from Mistral, this time specifically trained for code. Devstral achieves a score of 46.8% on SWE-Bench Verified, outperforming prior open-source SoTA models by more than 6% points. When evaluated under the same test scaffold (OpenHands, provided by…

  • Simon Willison’s Weblog: Gemini Diffusion

    Source URL: https://simonwillison.net/2025/May/21/gemini-diffusion/ Source: Simon Willison’s Weblog Title: Gemini Diffusion Feedly Summary: Gemini Diffusion Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google’s first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers. Google describe it like this: Traditional autoregressive language models generate text…

  • OpenAI : New tools and features in the Responses API

    Source URL: https://openai.com/index/new-tools-and-features-in-the-responses-api Source: OpenAI Title: New tools and features in the Responses API Feedly Summary: New features in the Responses API: Remote MCP, image gen, Code Interpreter, and more. Powering faster, smarter agents with GPT-4o & o-series models, plus new features for reliability and efficiency. AI Summary and Description: Yes Summary: The text discusses…

  • Slashdot: Google Launches Veo 3, an AI Video Generator That Incorporates Audio

    Source URL: https://tech.slashdot.org/story/25/05/20/2042219/google-launches-veo-3-an-ai-video-generator-that-incorporates-audio Source: Slashdot Title: Google Launches Veo 3, an AI Video Generator That Incorporates Audio Feedly Summary: AI Summary and Description: Yes Summary: Google has launched Veo 3, a noteworthy AI video generator that incorporates synchronized audio, alongside Imagen 4 and Flow for image and video creation. These tools emphasize enhancements in user…

  • Simon Willison’s Weblog: Gemini 2.5: Our most intelligent models are getting even better

    Source URL: https://simonwillison.net/2025/May/20/gemini-25/#atom-everything Source: Simon Willison’s Weblog Title: Gemini 2.5: Our most intelligent models are getting even better Feedly Summary: Gemini 2.5: Our most intelligent models are getting even better A bunch of new Gemini 2.5 announcements at Google I/O today. 2.5 Flash and 2.5 Pro are both getting audio output (previously previewed in Gemini…

  • Cloud Blog: Expanding Vertex AI with the next wave of generative AI media models

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/announcing-veo-3-imagen-4-and-lyria-2-on-vertex-ai/ Source: Cloud Blog Title: Expanding Vertex AI with the next wave of generative AI media models Feedly Summary: Today, we are introducing the next wave of generative AI media models on Vertex AI: Imagen 4, Veo 3, and Lyria 2.  We’ve already seen customers generate stunning, photorealistic images with Imagen 3, Google’s…

  • Cloud Blog: Gemini 2.5 Flash and Pro expand on Vertex AI to drive more sophisticated and secure AI innovation

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/expanding-gemini-2-5-flash-and-pro-capabilities/ Source: Cloud Blog Title: Gemini 2.5 Flash and Pro expand on Vertex AI to drive more sophisticated and secure AI innovation Feedly Summary: Today at Google I/O, we’re expanding Gemini 2.5 Flash and Pro model capabilities that help enterprises build more sophisticated and secure AI-driven applications and agents: Thought summaries: For enterprise-grade…