Tag: workload optimization

Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gke-autopilot-now-available-to-all-qualifying-clusters/ Source: Cloud Blog Title: Meet the new GKE: Extending Autopilot to all qualifying clusters Feedly Summary: Autopilot is an operational mode for Google Kubernetes Engine (GKE) that provides a fully managed environment and takes care of operational details, like provisioning compute capacity for your workloads. Autopilot allows you to spend more time…

Cloud Blog: Scaling high-performance inference cost-effectively

Sep 10, 2025

—

by

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/gke-inference-gateway-and-quickstart-are-ga/ Source: Cloud Blog Title: Scaling high-performance inference cost-effectively Feedly Summary: At Google Cloud Next 2025, we announced new inference capabilities with GKE Inference Gateway, including support for vLLM on TPUs, Ironwood TPUs, and Anywhere Cache. Our inference solution is based on AI Hypercomputer, a system built on our experience running models like…

AWS News Blog: New Amazon EC2 Graviton4-based instances with NVMe SSD storage

Apr 21, 2025

—

by

Source URL: https://aws.amazon.com/blogs/aws/new-amazon-ec2-graviton4-based-instances-with-nvme-ssd-storage/ Source: AWS News Blog Title: New Amazon EC2 Graviton4-based instances with NVMe SSD storage Feedly Summary: AWS introduces new EC2 instance families (C8gd, M8gd, R8gd) powered by Graviton4 processors with NVMe SSD storage, offering up to 30% better performance, 3x more vCPUs and memory, and up to 11.4TB local storage compared to…

Cloud Blog: Spring cleaning with FinOps Hub 2.0

Apr 16, 2025

—

by

Source URL: https://cloud.google.com/blog/topics/cost-management/spring-cleaning-with-finops-hub/ Source: Cloud Blog Title: Spring cleaning with FinOps Hub 2.0 Feedly Summary: Spring is a great reminder to spring clean – an annual tradition that should extend not only to your household, but also to your virtual cloud infrastructure. Why not start with Google Cloud’s FinOps Hub? As Google Cloud customers have…

Cloud Blog: Five tips and tricks to improve your AI workloads

Mar 18, 2025

—

by

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/reduce-cost-and-improve-your-ai-workloads/ Source: Cloud Blog Title: Five tips and tricks to improve your AI workloads Feedly Summary: Recently, we announced Gemini Code Assist for individuals, a free version of our AI coding assistant. Technology that was previously available only to the biggest enterprises is now within reach for startups and individual developers. The same…

Cloud Blog: Google is a Leader, positioned furthest in vision in the 2024 Gartner Magic Quadrant for Cloud Database Management Systems

Dec 23, 2024

—

by

Source URL: https://cloud.google.com/blog/products/databases/2024-gartner-magic-quadrant-for-cloud-database-management-systems/ Source: Cloud Blog Title: Google is a Leader, positioned furthest in vision in the 2024 Gartner Magic Quadrant for Cloud Database Management Systems Feedly Summary: Gartner has recognized Google as a Leader in the 2024 Gartner® Magic Quadrant™ for Cloud Database Management Systems for the fifth year in a row. Google is…

Hacker News: AI Flame Graphs

Oct 30, 2024

—

by

Source URL: https://www.brendangregg.com/blog//2024-10-29/ai-flame-graphs.html Source: Hacker News Title: AI Flame Graphs Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses Intel’s development of a tool called AI Flame Graphs, designed to optimize AI workloads by profiling resource utilization on AI accelerators and GPUs. By visualizing the software stack and identifying inefficiencies, this tool…

Cloud Blog: AI Hypercomputer software updates: Faster training and inference, a new resource hub, and more

Oct 25, 2024

—

by

Source URL: https://cloud.google.com/blog/products/compute/updates-to-ai-hypercomputer-software-stack/ Source: Cloud Blog Title: AI Hypercomputer software updates: Faster training and inference, a new resource hub, and more Feedly Summary: The potential of AI has never been greater, and infrastructure plays a foundational role in driving it forward. AI Hypercomputer is our supercomputing architecture based on performance-optimized hardware, open software, and flexible…

Cloud Blog: Google is a Leader in Gartner Magic Quadrant for Strategic Cloud Platform Services

Oct 23, 2024

—

by