Tag: mean
-
Cloud Blog: Enhancing backup vaults with support for Persistent Disk, Hyperdisk, and multi-regions
Source URL: https://cloud.google.com/blog/products/storage-data-transfer/backup-vaults-add-support-for-disk-backup-and-multi-region/ Source: Cloud Blog Title: Enhancing backup vaults with support for Persistent Disk, Hyperdisk, and multi-regions Feedly Summary: To help protect against evolving digital threats like ransomware and malicious deletions, last year, we introduced backup vault in the Google Cloud Backup and DR service, with support for Compute Engine VM backups. This provided…
-
Cloud Blog: GKE workload scheduling: Strategies for when resources get tight
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gke-features-to-optimize-resource-allocation/ Source: Cloud Blog Title: GKE workload scheduling: Strategies for when resources get tight Feedly Summary: As a customer of Google Kubernetes Engine (GKE), you’ve selected a container runtime with a high degree of managed operations, encompassing everything from automatic upgrades to effortless node management. This inherent efficiency allows you to focus more…
-
Cloud Blog: Save early and often with multi-tier checkpointing to optimize large AI training jobs
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/using-multi-tier-checkpointing-for-large-ai-training-jobs/ Source: Cloud Blog Title: Save early and often with multi-tier checkpointing to optimize large AI training jobs Feedly Summary: As foundation model training infrastructure scales to tens of thousands of accelerators, efficient utilization of those high-value resources becomes paramount. In particular, as the cluster gets larger, hardware failures become more frequent (~…