Tag: node pool
-
Cloud Blog: Meet the new GKE: Extending Autopilot to all qualifying clusters
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gke-autopilot-now-available-to-all-qualifying-clusters/ Source: Cloud Blog Title: Meet the new GKE: Extending Autopilot to all qualifying clusters Feedly Summary: Autopilot is an operational mode for Google Kubernetes Engine (GKE) that provides a fully managed environment and takes care of operational details, like provisioning compute capacity for your workloads. Autopilot allows you to spend more time…
-
Cloud Blog: Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer
Source URL: https://cloud.google.com/blog/products/compute/ai-inference-recipe-using-nvidia-dynamo-with-ai-hypercomputer/ Source: Cloud Blog Title: Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer Feedly Summary: As generative AI becomes more widespread, it’s important for developers and ML engineers to be able to easily configure infrastructure that supports efficient AI inference, i.e., using a trained AI model to make…
-
Cloud Blog: From clicks to clusters: Expanding Confidential Computing with Intel TDX
Source URL: https://cloud.google.com/blog/products/identity-security/from-clicks-to-clusters-confidential-computing-expands-with-intel-tdx/ Source: Cloud Blog Title: From clicks to clusters: Expanding Confidential Computing with Intel TDX Feedly Summary: Privacy-protecting Confidential Computing has come a long way since we introduced Confidential Virtual Machines (VMs) five years ago. The technology, which can protect data while in use, strengthens a security gap beyond data encryption at rest…
-
Cloud Blog: Five ways Skopeo can simplify your Google Cloud container workflow
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/five-ways-skopeo-can-simplify-your-google-cloud-container-workflow/ Source: Cloud Blog Title: Five ways Skopeo can simplify your Google Cloud container workflow Feedly Summary: Managing container images effectively is crucial for modern application development and deployment, especially in Cloud environments. Popular tools like Docker are commonly used to pull, push, and inspect container images. However, the reliance on a running…
-
Cloud Blog: Designing a multi-tenant GKE platform for Yahoo Mail’s migration journey
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/understanding-yahoo-mails-multi-tenant-gke-platform-design/ Source: Cloud Blog Title: Designing a multi-tenant GKE platform for Yahoo Mail’s migration journey Feedly Summary: Yahoo is in the midst of a multi-year journey to migrate its renowned Yahoo Mail application onto Google Cloud. With more than 100 services and middleware components in the application, Yahoo Mail is primarily taking a…
-
Cloud Blog: Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/implementing-high-performance-llm-serving-on-gke-an-inference-gateway-walkthrough/ Source: Cloud Blog Title: Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough Feedly Summary: The excitement around open Large Language Models like Gemma, Llama, Mistral, and Qwen is evident, but developers quickly hit a wall. How do you deploy them effectively at scale? Traditional load balancing algorithms fall short, as…
-
Cloud Blog: Expanding Z3 family with 9 new VMs and a bare metal instance for storage and I/O intensive workloads
Source URL: https://cloud.google.com/blog/products/compute/expanded-z3-vm-portfolio-for-io-intensive-workloads/ Source: Cloud Blog Title: Expanding Z3 family with 9 new VMs and a bare metal instance for storage and I/O intensive workloads Feedly Summary: Today, we are thrilled to announce the expansion of the Z3 Storage Optimized VM family with the general availability of nine new Z3 virtual machines that offer local…
-
Cloud Blog: GKE workload scheduling: Strategies for when resources get tight
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gke-features-to-optimize-resource-allocation/ Source: Cloud Blog Title: GKE workload scheduling: Strategies for when resources get tight Feedly Summary: As a customer of Google Kubernetes Engine (GKE), you’ve selected a container runtime with a high degree of managed operations, encompassing everything from automatic upgrades to effortless node management. This inherent efficiency allows you to focus more…
-
Cloud Blog: Supercharge data access performance with GKE Data Cache
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gke-data-cache-now-ga-accelerates-stateful-apps/ Source: Cloud Blog Title: Supercharge data access performance with GKE Data Cache Feedly Summary: Today, we’re excited to announce the general availability (GA) of GKE Data Cache, a powerful new solution for Google Kubernetes Engine to accelerate the performance of read-heavy stateful or stateless applications that rely on persistent storage via network…
-
Cloud Blog: Accelerating AI in healthcare using NVIDIA BioNeMo Framework and Blueprints on GKE
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/accelerate-ai-in-healthcare-nvidia-bionemo-gke/ Source: Cloud Blog Title: Accelerating AI in healthcare using NVIDIA BioNeMo Framework and Blueprints on GKE Feedly Summary: The quest to develop new medical treatments has historically been a slow, arduous process, screening billions of molecular compounds across decade-long development cycles. The vast majority of therapeutic candidates do not even make it…