Tag: high-throughput
-
Cloud Blog: How Baseten achieves 225% better cost-performance for AI inference (and you can too)
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/how-baseten-achieves-better-cost-performance-for-ai-inference/ Source: Cloud Blog Title: How Baseten achieves 225% better cost-performance for AI inference (and you can too) Feedly Summary: Baseten is one of a growing number of AI infrastructure providers, helping other startups run their models and experiments at speed and scale. Given the importance of those two factors to its customers,…
-
Cloud Blog: Scalable AI starts with storage: Guide to model artifact strategies
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/scalable-ai-starts-with-storage-guide-to-model-artifact-strategies/ Source: Cloud Blog Title: Scalable AI starts with storage: Guide to model artifact strategies Feedly Summary: Managing large model artifacts is a common bottleneck in MLOps. Baking models into container images leads to slow, monolithic deployments, and downloading them at startup introduces significant delays. This guide explores a better way: decoupling your…
-
Cloud Blog: Now GA: C4 VMs with Local SSD, bare metal, and larger shapes, on Intel Xeon 6
Source URL: https://cloud.google.com/blog/products/compute/c4-vms-based-on-intel-6th-gen-xeon-granite-rapids-now-ga/ Source: Cloud Blog Title: Now GA: C4 VMs with Local SSD, bare metal, and larger shapes, on Intel Xeon 6 Feedly Summary: We’re thrilled to announce a significant expansion of our C4 virtual machine series, with the general availability of 28 powerful new shapes. This expansion introduces C4 shapes with Google’s next-gen…
-
Cloud Blog: Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/take-an-open-model-from-discovery-to-endpoint-on-vertex-ai/ Source: Cloud Blog Title: Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI Feedly Summary: Developers building with gen AI are increasingly drawn to open models for their power and flexibility. But customizing and deploying them can be a huge challenge. You’re often left wrestling…
-
Cloud Blog: The Dataproc advantage: Advanced Spark features that will transform your analytics and AI
Source URL: https://cloud.google.com/blog/products/data-analytics/why-use-dataproc-for-your-apache-spark-environment/ Source: Cloud Blog Title: The Dataproc advantage: Advanced Spark features that will transform your analytics and AI Feedly Summary: With its exceptional price-performance, Google Cloud’s Dataproc has evolved from a simple, managed open-source software (OSS) service to a powerhouse in Apache Spark and open lakehouses, driving the analytics and AI workloads of…
-
Cloud Blog: Accelerate your AI workloads with the Google Cloud Managed Lustre
Source URL: https://cloud.google.com/blog/products/storage-data-transfer/google-cloud-managed-lustre-for-ai-hpc/ Source: Cloud Blog Title: Accelerate your AI workloads with the Google Cloud Managed Lustre Feedly Summary: Today, we’re making it even easier to achieve breakthrough performance for your AI/ML workloads: Google Cloud Managed Lustre is now GA, and available in four distinct performance tiers that deliver throughput ranging from 125 MB/s, 250…
-
Cloud Blog: Expanding Z3 family with 9 new VMs and a bare metal instance for storage and I/O intensive workloads
Source URL: https://cloud.google.com/blog/products/compute/expanded-z3-vm-portfolio-for-io-intensive-workloads/ Source: Cloud Blog Title: Expanding Z3 family with 9 new VMs and a bare metal instance for storage and I/O intensive workloads Feedly Summary: Today, we are thrilled to announce the expansion of the Z3 Storage Optimized VM family with the general availability of nine new Z3 virtual machines that offer local…
-
AWS News Blog: AWS Weekly Roundup: re:Inforce re:Cap, Valkey GLIDE 2.0, Avro and Protobuf or MCP Servers on Lambda, and more (June 23, 2025)
Source URL: https://aws.amazon.com/blogs/aws/aws-weekly-roundup-reinforce-recap-valkey-glide-2-0-avro-and-protobuf-or-mcp-servers-on-lambda-and-more-june-23-2025/ Source: AWS News Blog Title: AWS Weekly Roundup: re:Inforce re:Cap, Valkey GLIDE 2.0, Avro and Protobuf or MCP Servers on Lambda, and more (June 23, 2025) Feedly Summary: Last week’s hallmark event was the security-focused AWS re:Inforce conference. Now a tradition, the blog team wrote a re:Cap post to summarize the announcements…
-
Cloud Blog: Gemini momentum continues with launch of 2.5 Flash-Lite and general availability of 2.5 Flash and Pro on Vertex AI
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/gemini-2-5-flash-lite-flash-pro-ga-vertex-ai/ Source: Cloud Blog Title: Gemini momentum continues with launch of 2.5 Flash-Lite and general availability of 2.5 Flash and Pro on Vertex AI Feedly Summary: The momentum of the Gemini 2.5 era continues to build. Following our recent announcements, we’re empowering enterprise builders and developers with even greater access to the intelligence,…
-
Cloud Blog: New G4 VMs with NVIDIA RTX PRO 6000 Blackwell power AI, graphics, gaming and beyond
Source URL: https://cloud.google.com/blog/products/compute/introducing-g4-vm-with-nvidia-rtx-pro-6000/ Source: Cloud Blog Title: New G4 VMs with NVIDIA RTX PRO 6000 Blackwell power AI, graphics, gaming and beyond Feedly Summary: Today, we’re excited to announce the preview of our new G4 VMs based on NVIDIA RTX PRO 6000 Blackwell Server edition — the first cloud provider to do so. This follows…