Tag: Gemini 2.0
-
Cloud Blog: How to use Gemini 2.5 to fine-tune video outputs on Vertex AI
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/how-to-fine-tune-video-outputs-using-vertex-ai/ Source: Cloud Blog Title: How to use Gemini 2.5 to fine-tune video outputs on Vertex AI Feedly Summary: Recently, we announced Gemini 2.5 is generally available on Vertex AI. As part of this update, tuning capabilities have extended beyond text outputs – now, you can tune image, audio, and video outputs on…
-
Cloud Blog: Multimodal agents tutorial: How to use Gemini, Langchain, and LangGraph to build agents for object detection
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/build-multimodal-agents-using-gemini-langchain-and-langgraph/ Source: Cloud Blog Title: Multimodal agents tutorial: How to use Gemini, Langchain, and LangGraph to build agents for object detection Feedly Summary: Here’s a common scenario when building AI agents that might feel confusing: How can you use the latest Gemini models and an open-source framework like LangChain and LangGraph to create…
-
Simon Willison’s Weblog: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM
Source URL: https://simonwillison.net/2025/May/31/snitchbench-with-llm/#atom-everything Source: Simon Willison’s Weblog Title: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM Feedly Summary: A fun new benchmark just dropped! Inspired by the Claude 4 system card – which showed that Claude 4 might just rat you out to the authorities if you told it to “take initiative" in…
-
Simon Willison’s Weblog: Gemini 2.5: Our most intelligent models are getting even better
Source URL: https://simonwillison.net/2025/May/20/gemini-25/#atom-everything Source: Simon Willison’s Weblog Title: Gemini 2.5: Our most intelligent models are getting even better Feedly Summary: Gemini 2.5: Our most intelligent models are getting even better A bunch of new Gemini 2.5 announcements at Google I/O today. 2.5 Flash and 2.5 Pro are both getting audio output (previously previewed in Gemini…
-
Simon Willison’s Weblog: Create and edit images with Gemini 2.0 in preview
Source URL: https://simonwillison.net/2025/May/7/gemini-images-preview/#atom-everything Source: Simon Willison’s Weblog Title: Create and edit images with Gemini 2.0 in preview Feedly Summary: Create and edit images with Gemini 2.0 in preview Gemini 2.0 Flash has had image generation capabilities for a while now, and they’re now available via the paid Gemini API – at 3.9 cents per generated…
-
Cloud Blog: Build live voice-driven agentic applications with Vertex AI Gemini Live API
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/build-voice-driven-applications-with-live-api/ Source: Cloud Blog Title: Build live voice-driven agentic applications with Vertex AI Gemini Live API Feedly Summary: Across industries, enterprises need efficient and proactive solutions. Imagine frontline professionals using voice commands and visual input to diagnose issues, access vital information, and initiate processes in real-time. The Gemini 2.0 Flash Live API empowers…
-
Simon Willison’s Weblog: Start building with Gemini 2.5 Flash
Source URL: https://simonwillison.net/2025/Apr/17/start-building-with-gemini-25-flash/ Source: Simon Willison’s Weblog Title: Start building with Gemini 2.5 Flash Feedly Summary: Start building with Gemini 2.5 Flash Google Gemini’s latest model is Gemini 2.5 Flash, available in (paid) preview as gemini-2.5-flash-preview-04-17. Building upon the popular foundation of 2.0 Flash, this new version delivers a major upgrade in reasoning capabilities, while…
-
Simon Willison’s Weblog: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet
Source URL: https://simonwillison.net/2025/Apr/14/gpt-4-1/ Source: Simon Willison’s Weblog Title: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet Feedly Summary: OpenAI introduced three new models this morning: GPT-4.1, GPT-4.1 mini and GPT-4.1 nano. These are API-only models right now, not available through the ChatGPT interface (though you can try them out…
-
Cloud Blog: Enabling global scientific discovery and innovation on Google Cloud
Source URL: https://cloud.google.com/blog/topics/hpc/powering-scientific-discovery-with-google-cloud/ Source: Cloud Blog Title: Enabling global scientific discovery and innovation on Google Cloud Feedly Summary: From unraveling the mysteries of our planet and the universe, to accelerating medical research and industrial innovation, scientific discovery impacts nearly every facet of human life. Today, scientific progress depends on the interplay of theory, experimentation, and…
-
Cloud Blog: Introducing Firebase Studio and agentic developer tools to build with Gemini
Source URL: https://cloud.google.com/blog/products/application-development/firebase-studio-lets-you-build-full-stack-ai-apps-with-gemini/ Source: Cloud Blog Title: Introducing Firebase Studio and agentic developer tools to build with Gemini Feedly Summary: Millions of developers use Firebase to engage their users, powering over 70 billion instances of apps every day, everywhere — from mobile devices and web browsers, to embedded platforms and agentic experiences. But full-stack development…