Tag: modalities
-
Cloud Blog: How to build a real-time voice agent with Gemini, Google ADK, and A2A protocol
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/build-a-real-time-voice-agent-with-gemini-adk/ Source: Cloud Blog Title: How to build a real-time voice agent with Gemini, Google ADK, and A2A protocol Feedly Summary: Building advanced conversational AI has moved well beyond text. Now, we can use AI to create real-time, voice-driven agents. However, these systems need low-latency, two-way communication, real-time information retrieval, and the ability…
-
AWS News Blog: Announcing Amazon Nova customization in Amazon SageMaker AI
Source URL: https://aws.amazon.com/blogs/aws/announcing-amazon-nova-customization-in-amazon-sagemaker-ai/ Source: AWS News Blog Title: Announcing Amazon Nova customization in Amazon SageMaker AI Feedly Summary: AWS now enables extensive customization of Amazon Nova foundation models through SageMaker AI with techniques including continued pre-training, supervised fine-tuning, direct preference optimization, reinforcement learning from human feedback and model distillation to better address domain-specific requirements across…
-
Gemini: Hear a podcast discussion about Gemini’s multimodal capabilities.
Source URL: https://blog.google/products/gemini/release-notes-podcast-gemini-multimodal/ Source: Gemini Title: Hear a podcast discussion about Gemini’s multimodal capabilities. Feedly Summary: The latest episode of the Google AI: Release Notes podcast focuses on how Gemini was built from the ground up as a multimodal model — meaning a model that works with tex… AI Summary and Description: Yes Summary: The…