GKE – Page 3 – Experimental News Clipping Site

Cloud Blog: 65,000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models

Nov 13, 2024

—

by

Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gke-65k-nodes-and-counting/ Source: Cloud Blog Title: 65,000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models Feedly Summary: As generative AI evolves, we’re beginning to see the transformative potential it is having across industries and our lives. And as large language models (LLMs) increase in size — current models are reaching…

Cloud Blog: A new flexible DNS-based approach for accessing the GKE control plane

Nov 11, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/containers-kubernetes/new-dns-based-endpoint-for-the-gke-control-plane/ Source: Cloud Blog Title: A new flexible DNS-based approach for accessing the GKE control plane Feedly Summary: If you run Google Kubernetes Engine (GKE), you know it’s important to secure access to the cluster control plane that handles Kubernetes API requests, so you can prevent unauthorized access while still being able to…

Cloud Blog: How Verve achieves 37% performance gains with C4 machines and new GKE features

Nov 11, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/infrastructure/how-verve-achieves-37-percent-performance-gains-with-new-gke-features-and-c4-deliver/ Source: Cloud Blog Title: How Verve achieves 37% performance gains with C4 machines and new GKE features Feedly Summary: Earlier this year, Google Cloud launched the highly anticipated C4 machine series, built on the latest Intel Xeon Scalable processors (5th Gen Emerald Rapids), setting a new industry-leading performance standard for both Google…

Cloud Blog: Etsy’s Service Platform on Cloud Run cuts deployment time from days to under an hour

Nov 8, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/infrastructure/etsys-service-platform-on-cloud-run-cuts-deployment-time-from-days-to-minutes/ Source: Cloud Blog Title: Etsy’s Service Platform on Cloud Run cuts deployment time from days to under an hour Feedly Summary: Introduction Etsy, a leading ecommerce marketplace for handmade, vintage, and unique items has a passion for delivering innovative and seamless experiences for customers. Like many fast growing companies, Etsy needed to…

Cloud Blog: How to deploy and serve multi-host gen AI large open models over GKE

Nov 8, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/deploy-and-serve-open-models-over-google-kubernetes-engine/ Source: Cloud Blog Title: How to deploy and serve multi-host gen AI large open models over GKE Feedly Summary: Context As generative AI experiences explosive growth fueled by advancements in LLMs (Large Language Models), access to open models is more critical than ever for developers. Open models are publicly available pre-trained foundational…

Cloud Blog: Elevate network security and migrate your legacy VPC firewall rules to Cloud NGFW

Nov 7, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/networking/from-vpc-firewall-rules-to-cloud-ngfw-network-firewall-policies/ Source: Cloud Blog Title: Elevate network security and migrate your legacy VPC firewall rules to Cloud NGFW Feedly Summary: Over the past 18 months, Google Cloud has significantly enhanced its network security offerings, culminating in the launch of Cloud Next-Generation Firewall (NGFW), formerly Cloud Firewall Plus. This evolution reflects the platform’s expanded…

Cloud Blog: C4A VMs now GA: Our first custom Arm-based Axion CPU

Oct 30, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/compute/try-c4a-the-first-google-axion-processor/ Source: Cloud Blog Title: C4A VMs now GA: Our first custom Arm-based Axion CPU Feedly Summary: At Google Next ‘24, we announced Google Axion Processors, our first custom Arm®-based CPUs designed for the data center. Today, we’re thrilled to announce the general availability of C4A virtual machines, the first Axion-based VM series,…

Cloud Blog: Powerful infrastructure innovations for your AI-first future

Oct 30, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/compute/trillium-sixth-generation-tpu-is-in-preview/ Source: Cloud Blog Title: Powerful infrastructure innovations for your AI-first future Feedly Summary: The rise of generative AI has ushered in an era of unprecedented innovation, demanding increasingly complex and more powerful AI models. These advanced models necessitate high-performance infrastructure capable of efficiently scaling AI training, tuning, and inferencing workloads while optimizing…

Hacker News: Leveraging Class E address space to mitigate IPv4 exhaustion issues in GKE

Oct 23, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/containers-kubernetes/how-class-e-addresses-solve-for-ip-address-exhaustion-in-gke/ Source: Hacker News Title: Leveraging Class E address space to mitigate IPv4 exhaustion issues in GKE Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the challenges of IP address exhaustion in Google Kubernetes Engine (GKE), highlighting the potential use of Class E IPv4 addresses as a solution. While…

Cloud Blog: Save on GPUs: Smarter autoscaling for your GKE inferencing workloads

Oct 23, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/containers-kubernetes/tuning-the-gke-hpa-to-run-inference-on-gpus/ Source: Cloud Blog Title: Save on GPUs: Smarter autoscaling for your GKE inferencing workloads Feedly Summary: While LLM models deliver immense value for an increasing number of use cases, running LLM inference workloads can be costly. If you’re taking advantage of the latest open models and infrastructure, autoscaling can help you optimize…

Tag: GKE