Tag: Kubernetes
-
Cloud Blog: Run OpenAI’s new gpt-oss model at scale with Google Kubernetes Engine
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/run-openais-new-gpt-oss-model-at-scale-with-gke/ Source: Cloud Blog Title: Run OpenAI’s new gpt-oss model at scale with Google Kubernetes Engine Feedly Summary: It’s exciting to see OpenAI contribute to the open ecosystem with the release of their new open weights model, gpt-oss. In keeping with our commitment to provide the best platform for open AI innovation, we’re…
-
Cloud Blog: How Yahoo Calendar broke free from hardware queues and DBA bottlenecks
Source URL: https://cloud.google.com/blog/products/infrastructure-modernization/how-yahoo-calendar-broke-free-from-hardware-queues-and-dba-bottlenecks/ Source: Cloud Blog Title: How Yahoo Calendar broke free from hardware queues and DBA bottlenecks Feedly Summary: Editor’s note: Yahoo Mail is in the midst of one of its largest infrastructure transformations to date: a multi-year effort to modernize hundreds of petabytes of services by moving to Google Cloud.The Yahoo Mail migration…
-
Cloud Blog: Supercharge your AI: GKE inference reference architecture, your blueprint for production-ready inference
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/supercharge-your-ai-gke-inference-reference-architecture-your-blueprint-for-production-ready-inference/ Source: Cloud Blog Title: Supercharge your AI: GKE inference reference architecture, your blueprint for production-ready inference Feedly Summary: The age of AI is here, and organizations everywhere are racing to deploy powerful models to drive innovation, enhance products, and create entirely new user experiences. But moving from a trained model in a…
-
Cloud Blog: Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs
Source URL: https://cloud.google.com/blog/products/compute/dynamic-workload-scheduler-calendar-mode-reserves-gpus-and-tpus/ Source: Cloud Blog Title: Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs Feedly Summary: Organizations need ML compute resources that can accommodate bursty peaks and periodic troughs. That means the consumption models for AI infrastructure need to evolve to be more cost-efficient, provide term flexibility, and support rapid…
-
Cloud Blog: Celebrating 10 years of GKE: Incredible customer journeys, amazing AI futures
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/10-years-of-gke-ebook/ Source: Cloud Blog Title: Celebrating 10 years of GKE: Incredible customer journeys, amazing AI futures Feedly Summary: The evolution of the cloud has been tremendous over the past decade. Every step of the way, Google Kubernetes Engine (GKE) has been there to meet new challenges. From giving DevOps more scalable foundations to…