Tag: Google Cloud project
-
Cloud Blog: Five ways Skopeo can simplify your Google Cloud container workflow
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/five-ways-skopeo-can-simplify-your-google-cloud-container-workflow/ Source: Cloud Blog Title: Five ways Skopeo can simplify your Google Cloud container workflow Feedly Summary: Managing container images effectively is crucial for modern application development and deployment, especially in Cloud environments. Popular tools like Docker are commonly used to pull, push, and inspect container images. However, the reliance on a running…
-
Cloud Blog: vLLM Performance Tuning: The Ultimate Guide to xPU Inference Configuration
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/vllm-performance-tuning-the-ultimate-guide-to-xpu-inference-configuration/ Source: Cloud Blog Title: vLLM Performance Tuning: The Ultimate Guide to xPU Inference Configuration Feedly Summary: Additional contributors include Hossein Sarshar, Ashish Narasimham, and Chenyang Li. Large Language Models (LLMs) are revolutionizing how we interact with technology, but serving these powerful models efficiently can be a challenge. vLLM has rapidly become…
-
Cloud Blog: Rightsizing LLM Serving on vLLM for GPUs and TPUs
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/rightsizing-llm-serving-on-vllm-for-gpus-and-tpus/ Source: Cloud Blog Title: Rightsizing LLM Serving on vLLM for GPUs and TPUs Feedly Summary: Additional contributors include Hossein Sarshar and Ashish Narasimham. Large Language Models (LLMs) are revolutionizing how we interact with technology, but serving these powerful models efficiently can be a challenge. vLLM has rapidly become the primary choice for…
-
Cloud Blog: Secure your storage: Best practices to prevent dangling bucket takeovers
Source URL: https://cloud.google.com/blog/products/identity-security/best-practices-to-prevent-dangling-bucket-takeovers/ Source: Cloud Blog Title: Secure your storage: Best practices to prevent dangling bucket takeovers Feedly Summary: Storage buckets are where your data lives in the cloud. Much like digital real estate, these buckets are your own plot of land on the internet. When you move away and no longer need a specific…
-
Cloud Blog: Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/implementing-high-performance-llm-serving-on-gke-an-inference-gateway-walkthrough/ Source: Cloud Blog Title: Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough Feedly Summary: The excitement around open Large Language Models like Gemma, Llama, Mistral, and Qwen is evident, but developers quickly hit a wall. How do you deploy them effectively at scale? Traditional load balancing algorithms fall short, as…
-
Cloud Blog: Enhancing GKE data protection with cross-project backup and restore
Source URL: https://cloud.google.com/blog/products/storage-data-transfer/backup-for-gke-supports-cross-project-backup-and-restore/ Source: Cloud Blog Title: Enhancing GKE data protection with cross-project backup and restore Feedly Summary: As Google Kubernetes Engine (GKE) deployments grow and scale, adopting a multi-project strategy in Google Cloud becomes a best practice for security and environment organization. Creating clear boundaries by using distinct projects for development, testing, and production…
-
Cloud Blog: Announcing Vertex AI Agent Engine Memory Bank available for everyone in preview
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-memory-bank-in-public-preview/ Source: Cloud Blog Title: Announcing Vertex AI Agent Engine Memory Bank available for everyone in preview Feedly Summary: Developers are racing to productize agents, but a common limitation is the absence of memory. Without memory, agents treat each interaction as the first, asking repetitive questions and failing to recall user preferences. This…
-
Cloud Blog: Audit smarter: Introducing Google Cloud’s Recommended AI Controls framework
Source URL: https://cloud.google.com/blog/products/identity-security/audit-smarter-introducing-our-recommended-ai-controls-framework/ Source: Cloud Blog Title: Audit smarter: Introducing Google Cloud’s Recommended AI Controls framework Feedly Summary: As organizations build new generative AI applications and AI agents to automate business workflows, security and risk management management leaders face a new set of governance challenges. The complex, often opaque nature of AI models and agents,…
-
Cloud Blog: Enhancing backup vaults with support for Persistent Disk, Hyperdisk, and multi-regions
Source URL: https://cloud.google.com/blog/products/storage-data-transfer/backup-vaults-add-support-for-disk-backup-and-multi-region/ Source: Cloud Blog Title: Enhancing backup vaults with support for Persistent Disk, Hyperdisk, and multi-regions Feedly Summary: To help protect against evolving digital threats like ransomware and malicious deletions, last year, we introduced backup vault in the Google Cloud Backup and DR service, with support for Compute Engine VM backups. This provided…
-
Cloud Blog: Simplify your streaming pipelines with new Pub/Sub Single Message Transforms
Source URL: https://cloud.google.com/blog/products/data-analytics/pub-sub-single-message-transforms/ Source: Cloud Blog Title: Simplify your streaming pipelines with new Pub/Sub Single Message Transforms Feedly Summary: Today, we’re introducing Pub/Sub Single Message Transforms (SMTs) to make it easy to perform simple data transformations right within Pub/Sub itself. This comes at a time when businesses are increasingly reliant on streaming data to derive…