Tag: cluster

  • Cloud Blog: Cybercrime: A Multifaceted National Security Threat

    Source URL: https://cloud.google.com/blog/topics/threat-intelligence/cybercrime-multifaceted-national-security-threat/ Source: Cloud Blog Title: Cybercrime: A Multifaceted National Security Threat Feedly Summary: Executive Summary Cybercrime makes up a majority of the malicious activity online and occupies the majority of defenders’ resources. In 2024, Mandiant Consulting responded to almost four times more intrusions conducted by financially motivated actors than state-backed intrusions. Despite this…

  • Cloud Blog: Balance of power: A full-stack approach to power and thermal fluctuations in ML infrastructure

    Source URL: https://cloud.google.com/blog/topics/systems/mitigating-power-and-thermal-fluctuations-in-ml-infrastructure/ Source: Cloud Blog Title: Balance of power: A full-stack approach to power and thermal fluctuations in ML infrastructure Feedly Summary: The recent explosion of machine learning (ML) applications has created unprecedented demand for power delivery in the data center infrastructure that underpins those applications. Unlike server clusters in the traditional data center,…

  • Cloud Blog: Networking support for AI workloads

    Source URL: https://cloud.google.com/blog/products/networking/cross-cloud-network-solutions-support-for-ai-workloads/ Source: Cloud Blog Title: Networking support for AI workloads Feedly Summary: At Google Cloud, we strive to make it easy to deploy AI models onto our infrastructure. In this blog we explore how the Cross-Cloud Network solution supports your AI workloads. Managed and Unmanaged AI options Google Cloud provides both managed (Vertex…

  • Cloud Blog: News you can use: What we announced in AI this month

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/what-google-cloud-announced-in-ai-this-month/ Source: Cloud Blog Title: News you can use: What we announced in AI this month Feedly Summary: 2025 is off to a racing start. From announcing strides in the new Gemini 2.0 model family to retailers accelerating with Cloud AI, we spent January investing in our partner ecosystem, open-source, and ways to…

  • Cloud Blog: How we improved GKE volume attachments for stateful applications by up to 80%

    Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gkes-faster-cluster-upgrades-under-the-hood/ Source: Cloud Blog Title: How we improved GKE volume attachments for stateful applications by up to 80% Feedly Summary: If you run stateful workloads on Google Kubernetes Engine (GKE), you may have noticed that your Cluster upgrades execute much faster as of late. You’re not imagining things. We recently introduced an  enhancement…

  • Hacker News: Show HN: Klarity – Open-source tool to analyze uncertainty/entropy in LLM output

    Source URL: https://github.com/klara-research/klarity Source: Hacker News Title: Show HN: Klarity – Open-source tool to analyze uncertainty/entropy in LLM output Feedly Summary: Comments AI Summary and Description: Yes **Summary:** Klarity is a robust tool designed for analyzing uncertainty in generative model predictions. By leveraging both raw probability and semantic comprehension, it provides unique insights into model…

  • Cloud Blog: Blackwell is here — new A4 VMs powered by NVIDIA B200 now in preview

    Source URL: https://cloud.google.com/blog/products/compute/introducing-a4-vms-powered-by-nvidia-b200-gpu-aka-blackwell/ Source: Cloud Blog Title: Blackwell is here — new A4 VMs powered by NVIDIA B200 now in preview Feedly Summary: Modern AI workloads require powerful accelerators and high-speed interconnects to run sophisticated model architectures on an ever-growing diverse range of model sizes and modalities. In addition to large-scale training, these complex models…