Tag: multimodal model

  • Simon Willison’s Weblog: Introducing Gemma 3n: The developer guide

    Source URL: https://simonwillison.net/2025/Jun/26/gemma-3n/ Source: Simon Willison’s Weblog Title: Introducing Gemma 3n: The developer guide Feedly Summary: Introducing Gemma 3n: The developer guide Extremely consequential new open weights model release from Google today: Multimodal by design: Gemma 3n natively supports image, audio, video, and text inputs and text outputs. Optimized for on-device: Engineered with a focus…

  • Simon Willison’s Weblog: Qwen2.5 Omni: See, Hear, Talk, Write, Do It All!

    Source URL: https://simonwillison.net/2025/Apr/28/qwen25-omni/#atom-everything Source: Simon Willison’s Weblog Title: Qwen2.5 Omni: See, Hear, Talk, Write, Do It All! Feedly Summary: Qwen2.5 Omni: See, Hear, Talk, Write, Do It All! I’m not sure how I missed this one at the time, but last month (March 27th) Qwen released their first multi-modal model that can handle audio and…

  • Cloud Blog: Day 1 at Google Cloud Next 25 recap

    Source URL: https://cloud.google.com/blog/topics/google-cloud-next/next25-day-1-recap/ Source: Cloud Blog Title: Day 1 at Google Cloud Next 25 recap Feedly Summary: Hello from Google Cloud Next 25 in Las Vegas! This year, it’s all about how AI can reimagine work and improve our lives — even bringing Hollywood classics like The Wizard of Oz to life on one of…

  • AWS News Blog: AWS announces Pixtral Large 25.02 model in Amazon Bedrock serverless

    Source URL: https://aws.amazon.com/blogs/aws/aws-announces-pixtral-large-25-02-model-in-amazon-bedrock-serverless/ Source: AWS News Blog Title: AWS announces Pixtral Large 25.02 model in Amazon Bedrock serverless Feedly Summary: Mistral AI’s multimodal model, Pixtral Large 25.02, is now available in Amazon Bedrock as a fully managed, serverless offering with cross-Region inference support, multilingual capabilities, and a 128K context window that can process images alongside…

  • Simon Willison’s Weblog: Quoting Ahmed Al-Dahle

    Source URL: https://simonwillison.net/2025/Apr/5/llama-4/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Ahmed Al-Dahle Feedly Summary: The Llama series have been re-designed to use state of the art mixture-of-experts (MoE) architecture and natively trained with multimodality. We’re dropping Llama 4 Scout & Llama 4 Maverick, and previewing Llama 4 Behemoth. 📌 Llama 4 Scout is highest performing small…

  • Cloud Blog: Vertex AI Search and Generative AI (with Gemini) achieve FedRAMP High

    Source URL: https://cloud.google.com/blog/topics/public-sector/vertex-ai-search-and-generative-ai-with-gemini-achieve-fedramp-high/ Source: Cloud Blog Title: Vertex AI Search and Generative AI (with Gemini) achieve FedRAMP High Feedly Summary: In the rapidly evolving AI landscape, security remains paramount. Today, we reinforce that commitment with another significant achievement: FedRAMP High authorization for Google Vertex AI Search and Generative AI on Vertex AI.This follows our announcement…

  • Cloud Blog: Vertex AI Search and Generative AI (with Gemini) achieve FedRAMP High

    Source URL: https://cloud.google.com/blog/topics/public-sector/vertex-ai-search-and-generative-ai-with-gemini-achieve-fedramp-high/ Source: Cloud Blog Title: Vertex AI Search and Generative AI (with Gemini) achieve FedRAMP High Feedly Summary: In the rapidly evolving AI landscape, security remains paramount. Today, we reinforce that commitment with another significant achievement: FedRAMP High authorization for Google Vertex AI Search and Generative AI on Vertex AI.This follows our announcement…