Tag: node pools
-
Cloud Blog: Announcing smaller machine types for A3 High VMs
Source URL: https://cloud.google.com/blog/products/compute/announcing-smaller-machine-types-for-a3-high-vms/ Source: Cloud Blog Title: Announcing smaller machine types for A3 High VMs Feedly Summary: Today, an increasing number of organizations are using GPUs to run inference1 on their AI/ML models. Since the number of GPUs needed to serve a single inference workload varies, organizations need more granularity in the number of GPUs…
-
Cloud Blog: Data loading best practices for AI/ML inference on GKE
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/improve-data-loading-times-for-ml-inference-apps-on-gke/ Source: Cloud Blog Title: Data loading best practices for AI/ML inference on GKE Feedly Summary: As AI models increase in sophistication, there’s increasingly large model data needed to serve them. Loading the models and weights along with necessary frameworks to serve them for inference can add seconds or even minutes of scaling…
-
Cloud Blog: GKE and the dreaded IP_SPACE_EXHAUSTED error: Understanding the culprit
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/avoiding-the-gke-ip_space_exhausted-error/ Source: Cloud Blog Title: GKE and the dreaded IP_SPACE_EXHAUSTED error: Understanding the culprit Feedly Summary: If you leverage Google Kubernetes Engine (GKE) within your Google Cloud environment, you’ve likely encountered the confidence-shattering “IP_SPACE_EXHAUSTED” error. It’s a common scenario: you’re convinced your IP address planning is flawless, your subnet design is future-proof, and…