Tag: Uber

  • Cloud Blog: New GKE inference capabilities reduce costs, tail latency and increase throughput

    Source URL: https://cloud.google.com/blog/products/containers-kubernetes/understanding-new-gke-inference-capabilities/ Source: Cloud Blog Title: New GKE inference capabilities reduce costs, tail latency and increase throughput Feedly Summary: When it comes to AI, inference is where today’s generative AI models can solve real-world business problems. Google Kubernetes Engine (GKE) is seeing increasing adoption of gen AI inference. For example, customers like HubX run…

  • Cloud Blog: Google Cloud and Oracle accelerate enterprise modernization with new offerings, regions, and capabilities

    Source URL: https://cloud.google.com/blog/products/databases/google-cloud-and-oracle-accelerate-enterprise-modernization/ Source: Cloud Blog Title: Google Cloud and Oracle accelerate enterprise modernization with new offerings, regions, and capabilities Feedly Summary: Supporting customers where they want to be is a core value at Google Cloud, and a big part of the reason that we have partnered with Oracle — so that you can innovate…

  • The Cloudflare Blog: Making Super Slurper 5x faster with Workers, Durable Objects, and Queues

    Source URL: https://blog.cloudflare.com/making-super-slurper-five-times-faster/ Source: The Cloudflare Blog Title: Making Super Slurper 5x faster with Workers, Durable Objects, and Queues Feedly Summary: We re-architected Super Slurper from the ground up using our Developer Platform — leveraging Cloudflare Workers, Durable Objects, and Queues — and improved transfer speeds by up to 5x. AI Summary and Description: Yes…

  • Cloud Blog: Introducing Ironwood TPUs and new innovations in AI Hypercomputer

    Source URL: https://cloud.google.com/blog/products/compute/whats-new-with-ai-hypercomputer/ Source: Cloud Blog Title: Introducing Ironwood TPUs and new innovations in AI Hypercomputer Feedly Summary: Today’s innovation isn’t born in a lab or at a drafting board; it’s built on the bedrock of AI infrastructure. AI workloads have new and unique demands — addressing these requires a finely crafted combination of hardware…

  • Simon Willison’s Weblog: An LLM Query Understanding Service

    Source URL: https://simonwillison.net/2025/Apr/9/an-llm-query-understanding-service/#atom-everything Source: Simon Willison’s Weblog Title: An LLM Query Understanding Service Feedly Summary: An LLM Query Understanding Service Doug Turnbull recently wrote about how all search is structured now: Many times, even a small open source LLM will be able to turn a search query into reasonable structure at relatively low cost. In…

  • Cloud Blog: Driving enterprise transformation with new compute innovations and offerings

    Source URL: https://cloud.google.com/blog/products/compute/delivering-new-compute-innovations-and-offerings/ Source: Cloud Blog Title: Driving enterprise transformation with new compute innovations and offerings Feedly Summary: In the last 12 months, we’ve made incredible enhancements to our Compute Engine platform. This is driven most notably by new fourth-generation compute instances and Hyperdisk block storage as well as major customer experience enhancements. Across all…

  • Cloud Blog: Driving secure innovation with AI and Google Unified Security

    Source URL: https://cloud.google.com/blog/products/identity-security/driving-secure-innovation-with-ai-google-unified-security-next25/ Source: Cloud Blog Title: Driving secure innovation with AI and Google Unified Security Feedly Summary: Today at Google Cloud Next, we are announcing Google Unified Security, new security agents, and innovations across our security portfolio designed to deliver stronger security outcomes and enable every organization to make Google a part of their…

  • Cloud Blog: What’s new with Google Cloud networking

    Source URL: https://cloud.google.com/blog/products/networking/networking-innovations-at-google-cloud-next25/ Source: Cloud Blog Title: What’s new with Google Cloud networking Feedly Summary: The AI era is here, fundamentally reshaping industries and demanding unprecedented network capabilities for training, inference and serving AI models. To power this transformation, organizations need global networking solutions that can handle massive capacity, seamless connectivity, and provide robust security. …