Tag: Google Cloud project
-
Cloud Blog: Launching Gemini CLI extensions for Google Data Cloud
Source URL: https://cloud.google.com/blog/products/databases/gemini-cli-extensions-for-google-data-cloud/ Source: Cloud Blog Title: Launching Gemini CLI extensions for Google Data Cloud Feedly Summary: In June, Google introduced Gemini CLI, an open-source AI agent that brings the power of Gemini directly into your terminal. And today, we’re excited to announce open-source Gemini CLI extensions for Google Data Cloud services. Building applications and…
-
Cloud Blog: Five ways Skopeo can simplify your Google Cloud container workflow
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/five-ways-skopeo-can-simplify-your-google-cloud-container-workflow/ Source: Cloud Blog Title: Five ways Skopeo can simplify your Google Cloud container workflow Feedly Summary: Managing container images effectively is crucial for modern application development and deployment, especially in Cloud environments. Popular tools like Docker are commonly used to pull, push, and inspect container images. However, the reliance on a running…
-
Cloud Blog: vLLM Performance Tuning: The Ultimate Guide to xPU Inference Configuration
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/vllm-performance-tuning-the-ultimate-guide-to-xpu-inference-configuration/ Source: Cloud Blog Title: vLLM Performance Tuning: The Ultimate Guide to xPU Inference Configuration Feedly Summary: Additional contributors include Hossein Sarshar, Ashish Narasimham, and Chenyang Li. Large Language Models (LLMs) are revolutionizing how we interact with technology, but serving these powerful models efficiently can be a challenge. vLLM has rapidly become…
-
Cloud Blog: Rightsizing LLM Serving on vLLM for GPUs and TPUs
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/rightsizing-llm-serving-on-vllm-for-gpus-and-tpus/ Source: Cloud Blog Title: Rightsizing LLM Serving on vLLM for GPUs and TPUs Feedly Summary: Additional contributors include Hossein Sarshar and Ashish Narasimham. Large Language Models (LLMs) are revolutionizing how we interact with technology, but serving these powerful models efficiently can be a challenge. vLLM has rapidly become the primary choice for…
-
Cloud Blog: Secure your storage: Best practices to prevent dangling bucket takeovers
Source URL: https://cloud.google.com/blog/products/identity-security/best-practices-to-prevent-dangling-bucket-takeovers/ Source: Cloud Blog Title: Secure your storage: Best practices to prevent dangling bucket takeovers Feedly Summary: Storage buckets are where your data lives in the cloud. Much like digital real estate, these buckets are your own plot of land on the internet. When you move away and no longer need a specific…
-
Cloud Blog: Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/implementing-high-performance-llm-serving-on-gke-an-inference-gateway-walkthrough/ Source: Cloud Blog Title: Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough Feedly Summary: The excitement around open Large Language Models like Gemma, Llama, Mistral, and Qwen is evident, but developers quickly hit a wall. How do you deploy them effectively at scale? Traditional load balancing algorithms fall short, as…
-
Cloud Blog: Enhancing GKE data protection with cross-project backup and restore
Source URL: https://cloud.google.com/blog/products/storage-data-transfer/backup-for-gke-supports-cross-project-backup-and-restore/ Source: Cloud Blog Title: Enhancing GKE data protection with cross-project backup and restore Feedly Summary: As Google Kubernetes Engine (GKE) deployments grow and scale, adopting a multi-project strategy in Google Cloud becomes a best practice for security and environment organization. Creating clear boundaries by using distinct projects for development, testing, and production…
-
Cloud Blog: Announcing Vertex AI Agent Engine Memory Bank available for everyone in preview
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-memory-bank-in-public-preview/ Source: Cloud Blog Title: Announcing Vertex AI Agent Engine Memory Bank available for everyone in preview Feedly Summary: Developers are racing to productize agents, but a common limitation is the absence of memory. Without memory, agents treat each interaction as the first, asking repetitive questions and failing to recall user preferences. This…
-
Cloud Blog: Audit smarter: Introducing Google Cloud’s Recommended AI Controls framework
Source URL: https://cloud.google.com/blog/products/identity-security/audit-smarter-introducing-our-recommended-ai-controls-framework/ Source: Cloud Blog Title: Audit smarter: Introducing Google Cloud’s Recommended AI Controls framework Feedly Summary: As organizations build new generative AI applications and AI agents to automate business workflows, security and risk management management leaders face a new set of governance challenges. The complex, often opaque nature of AI models and agents,…
-
Cloud Blog: Enhancing backup vaults with support for Persistent Disk, Hyperdisk, and multi-regions
Source URL: https://cloud.google.com/blog/products/storage-data-transfer/backup-vaults-add-support-for-disk-backup-and-multi-region/ Source: Cloud Blog Title: Enhancing backup vaults with support for Persistent Disk, Hyperdisk, and multi-regions Feedly Summary: To help protect against evolving digital threats like ransomware and malicious deletions, last year, we introduced backup vault in the Google Cloud Backup and DR service, with support for Compute Engine VM backups. This provided…