Tag: high-bandwidth memory
-
The Register: Nvidia’s context-optimized Rubin CPX GPUs were inevitable
Source URL: https://www.theregister.com/2025/09/10/nvidia_rubin_cpx/ Source: The Register Title: Nvidia’s context-optimized Rubin CPX GPUs were inevitable Feedly Summary: Why strap pricey, power-hungry HBM to a job that doesn’t benefit from the bandwidth? Analysis Nvidia on Tuesday unveiled the Rubin CPX, a GPU designed specifically to accelerate extremely long-context AI workflows like those seen in code assistants such…
-
Cloud Blog: Google is a Leader in the Gartner® Magic Quadrant for Strategic Cloud Platform Services
Source URL: https://cloud.google.com/blog/products/compute/google-is-a-leader-in-gartner-magic-quadrant-for-scps/ Source: Cloud Blog Title: Google is a Leader in the Gartner® Magic Quadrant for Strategic Cloud Platform Services Feedly Summary: For the eighth consecutive year, Gartner® has named Google a Leader in the Gartner Magic Quadrant™ for Strategic Cloud Platform Services, and this year Google is also now ranked the highest for…
-
Cloud Blog: Announcing a new monitoring library to optimize TPU performance
Source URL: https://cloud.google.com/blog/products/compute/new-monitoring-library-to-optimize-google-cloud-tpu-resources/ Source: Cloud Blog Title: Announcing a new monitoring library to optimize TPU performance Feedly Summary: For more than a decade, TPUs have powered Google’s most demanding AI training and serving workloads. And there is strong demand from customers for Cloud TPUs as well. When running advanced AI workloads, you need to be…
-
The Register: The network is indeed trying to become the computer
Source URL: https://www.theregister.com/2025/06/27/analysis_network_computing/ Source: The Register Title: The network is indeed trying to become the computer Feedly Summary: Masked networking costs are coming to AI systems Analysis Moore’s Law has run out of gas and AI workloads need massive amounts of parallel compute and high bandwidth memory right next to it – both of which…
-
Cloud Blog: Speed up checkpoint loading time at scale using Orbax on JAX
Source URL: https://cloud.google.com/blog/products/compute/unlock-faster-workload-start-time-using-orbax-on-jax/ Source: Cloud Blog Title: Speed up checkpoint loading time at scale using Orbax on JAX Feedly Summary: Imagine training a new AI / ML model like Gemma 3 or Llama 3.3 across hundreds of powerful accelerators like TPUs or GPUs to achieve a scientific breakthrough. You might have a team of powerful…