Tag: Kubernetes Engine

  • Cloud Blog: Save on GPUs: Smarter autoscaling for your GKE inferencing workloads

    Source URL: https://cloud.google.com/blog/products/containers-kubernetes/tuning-the-gke-hpa-to-run-inference-on-gpus/ Source: Cloud Blog Title: Save on GPUs: Smarter autoscaling for your GKE inferencing workloads Feedly Summary: While LLM models deliver immense value for an increasing number of use cases, running LLM inference workloads can be costly. If you’re taking advantage of the latest open models and infrastructure, autoscaling can help you optimize…

  • Cloud Blog: Google is a Leader in Gartner Magic Quadrant for Strategic Cloud Platform Services

    Source URL: https://cloud.google.com/blog/products/infrastructure-modernization/google-is-a-leader-in-gartner-magic-quadrant-for-strategic-cloud-platform-services/ Source: Cloud Blog Title: Google is a Leader in Gartner Magic Quadrant for Strategic Cloud Platform Services Feedly Summary: For the seventh consecutive year, Gartner® has named Google a Leader in the Gartner Magic Quadrant™ for Strategic Cloud Platform Services. This year marks a major milestone: Google has made a notable jump…

  • Cloud Blog: How to benchmark application performance from the user’s perspective

    Source URL: https://cloud.google.com/blog/products/containers-kubernetes/benchmarking-how-end-users-perceive-an-applications-performance/ Source: Cloud Blog Title: How to benchmark application performance from the user’s perspective Feedly Summary: What kind of performance does your application have, and how do you know? More to the point, what kind of performance do your end users think your application has?  In this era of rapid growth and unpredictable…

  • Cloud Blog: Founders share five takeaways from the Google Cloud Startup Summit

    Source URL: https://cloud.google.com/blog/topics/startups/founders-share-five-takeaways-from-the-google-cloud-startup-summit/ Source: Cloud Blog Title: Founders share five takeaways from the Google Cloud Startup Summit Feedly Summary: We recently hosted our annual Google Cloud Startup Summit, and we were thrilled to showcase a wide range of AI startups leveraging Google Cloud, including Higgsfield AI, Click Therapeutics, Baseten, LiveX AI, Reve AI, and Vellum.…

  • Cloud Blog: GKE and the dreaded IP_SPACE_EXHAUSTED error: Understanding the culprit

    Source URL: https://cloud.google.com/blog/products/containers-kubernetes/avoiding-the-gke-ip_space_exhausted-error/ Source: Cloud Blog Title: GKE and the dreaded IP_SPACE_EXHAUSTED error: Understanding the culprit Feedly Summary: If you leverage Google Kubernetes Engine (GKE) within your Google Cloud environment, you’ve likely encountered the confidence-shattering “IP_SPACE_EXHAUSTED” error.  It’s a common scenario: you’re convinced your IP address planning is flawless, your subnet design is future-proof, and…

  • Cloud Blog: Choosing between self-hosted GKE and managed Vertex AI to host AI models

    Source URL: https://cloud.google.com/blog/products/application-development/choosing-a-self-hosted-or-managed-solution-for-ai-app-development/ Source: Cloud Blog Title: Choosing between self-hosted GKE and managed Vertex AI to host AI models Feedly Summary: In today’s technology landscape, building or modernizing applications demands a clear understanding of your business goals and use cases. This insight is crucial for leveraging emerging tools effectively, especially generative AI foundation models such…