Tag: generative

  • Simon Willison’s Weblog: Quoting Drew Breunig

    Source URL: https://simonwillison.net/2025/Apr/10/drew-breunig/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Drew Breunig Feedly Summary: The first generation of AI-powered products (often called “AI Wrapper” apps, because they “just” are wrapped around an LLM API) were quickly brought to market by small teams of engineers, picking off the low-hanging problems. But today, I’m seeing teams of domain…

  • Simon Willison’s Weblog: LLM pricing calculator (updated)

    Source URL: https://simonwillison.net/2025/Apr/10/llm-pricing-calculator/#atom-everything Source: Simon Willison’s Weblog Title: LLM pricing calculator (updated) Feedly Summary: LLM pricing calculator (updated) I updated my LLM pricing calculator this morning (Claude transcript) to show the prices of various hosted models in a sorted table, defaulting to lowest price first. Amazon Nova and Google Gemini continue to dominate the lower…

  • Cloud Blog: New GKE inference capabilities reduce costs, tail latency and increase throughput

    Source URL: https://cloud.google.com/blog/products/containers-kubernetes/understanding-new-gke-inference-capabilities/ Source: Cloud Blog Title: New GKE inference capabilities reduce costs, tail latency and increase throughput Feedly Summary: When it comes to AI, inference is where today’s generative AI models can solve real-world business problems. Google Kubernetes Engine (GKE) is seeing increasing adoption of gen AI inference. For example, customers like HubX run…

  • Simon Willison’s Weblog: llm-fragments-go

    Source URL: https://simonwillison.net/2025/Apr/10/llm-fragments-go/#atom-everything Source: Simon Willison’s Weblog Title: llm-fragments-go Feedly Summary: llm-fragments-go Filippo Valsorda released the first plugin by someone other than me that uses LLM’s new register_fragment_loaders() plugin hook I announced the other day. Install with llm install llm-fragments-go and then: You can feed the docs of a Go package into LLM using the…

  • Cloud Blog: Day 1 at Google Cloud Next 25 recap

    Source URL: https://cloud.google.com/blog/topics/google-cloud-next/next25-day-1-recap/ Source: Cloud Blog Title: Day 1 at Google Cloud Next 25 recap Feedly Summary: Hello from Google Cloud Next 25 in Las Vegas! This year, it’s all about how AI can reimagine work and improve our lives — even bringing Hollywood classics like The Wizard of Oz to life on one of…

  • Simon Willison’s Weblog: An LLM Query Understanding Service

    Source URL: https://simonwillison.net/2025/Apr/9/an-llm-query-understanding-service/#atom-everything Source: Simon Willison’s Weblog Title: An LLM Query Understanding Service Feedly Summary: An LLM Query Understanding Service Doug Turnbull recently wrote about how all search is structured now: Many times, even a small open source LLM will be able to turn a search query into reasonable structure at relatively low cost. In…

  • Cloud Blog: Global startups are building the future of AI on Google Cloud

    Source URL: https://cloud.google.com/blog/topics/startups/why-global-startups-are-gathering-at-google-cloud-next25/ Source: Cloud Blog Title: Global startups are building the future of AI on Google Cloud Feedly Summary: The most exciting startups in the world are in Las Vegas this week, as Google Cloud Next kicks off with a major focus on how AI and cloud are powering the next great wave of…