Tag: json
-
Simon Willison’s Weblog: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet
Source URL: https://simonwillison.net/2025/Apr/14/gpt-4-1/ Source: Simon Willison’s Weblog Title: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet Feedly Summary: OpenAI introduced three new models this morning: GPT-4.1, GPT-4.1 mini and GPT-4.1 nano. These are API-only models right now, not available through the ChatGPT interface (though you can try them out…
-
Simon Willison’s Weblog: An LLM Query Understanding Service
Source URL: https://simonwillison.net/2025/Apr/9/an-llm-query-understanding-service/#atom-everything Source: Simon Willison’s Weblog Title: An LLM Query Understanding Service Feedly Summary: An LLM Query Understanding Service Doug Turnbull recently wrote about how all search is structured now: Many times, even a small open source LLM will be able to turn a search query into reasonable structure at relatively low cost. In…
-
Simon Willison’s Weblog: Model Context Protocol has prompt injection security problems
Source URL: https://simonwillison.net/2025/Apr/9/mcp-prompt-injection/#atom-everything Source: Simon Willison’s Weblog Title: Model Context Protocol has prompt injection security problems Feedly Summary: As more people start hacking around with implementations of MCP (the Model Context Protocol, a new standard for making tools available to LLM-powered systems) the security implications of tools built on that protocol are starting to come…
-
Cloud Blog: Google Cloud databases supercharge the AI developer experience
Source URL: https://cloud.google.com/blog/products/databases/whats-new-for-google-cloud-databases-at-next25/ Source: Cloud Blog Title: Google Cloud databases supercharge the AI developer experience Feedly Summary: Generative AI continues to capture our imagination and promises to transform every industry. Its transformative potential hinges on integrating powerful models like Gemini, with the most contextually-relevant enterprise data. Google Cloud is leading this transformation, not only by…
-
Simon Willison’s Weblog: Long context support in LLM 0.24 using fragments and template plugins
Source URL: https://simonwillison.net/2025/Apr/7/long-context-llm/#atom-everything Source: Simon Willison’s Weblog Title: Long context support in LLM 0.24 using fragments and template plugins Feedly Summary: LLM 0.24 is now available with new features to help take advantage of the increasingly long input context supported by modern LLMs. (LLM is my command-line tool and Python library for interacting with LLMs,…
-
Cloud Blog: Accelerate analytics with AI-assisted data preparation in BigQuery, now GA
Source URL: https://cloud.google.com/blog/products/data-analytics/ai-assisted-bigquery-data-preparation-now-ga/ Source: Cloud Blog Title: Accelerate analytics with AI-assisted data preparation in BigQuery, now GA Feedly Summary: According to Gartner®, “Gartner clients now report that 90% or more of their time is spent preparing data (as high as 94% in complex industries) for advanced analytics, data science and data engineering.”1. Last year, we…
-
Simon Willison’s Weblog: Nomic Embed Code: A State-of-the-Art Code Retriever
Source URL: https://simonwillison.net/2025/Mar/27/nomic-embed-code/ Source: Simon Willison’s Weblog Title: Nomic Embed Code: A State-of-the-Art Code Retriever Feedly Summary: Nomic Embed Code: A State-of-the-Art Code Retriever Nomic have released a new embedding model that specializes in code, based on their CoRNStack “large-scale high-quality training dataset specifically curated for code retrieval". The nomic-embed-code model is pretty large –…
-
Cloud Blog: Build richer gen AI experiences using model endpoint management
Source URL: https://cloud.google.com/blog/products/databases/use-model-endpoint-management-on-alloydb/ Source: Cloud Blog Title: Build richer gen AI experiences using model endpoint management Feedly Summary: Model endpoint management is available on AlloyDB, AlloyDB Omni and Cloud SQL for PostgreSQL. Model endpoint management helps developers to build new experiences using SQL and provides a flexible interface to call gen AI models running anywhere…
-
Simon Willison’s Weblog: Function calling with Gemma
Source URL: https://simonwillison.net/2025/Mar/26/function-calling-with-gemma/#atom-everything Source: Simon Willison’s Weblog Title: Function calling with Gemma Feedly Summary: Function calling with Gemma Google’s Gemma 3 model (the 27B variant is particularly capable, I’ve been trying it out via Ollama) supports function calling exclusively through prompt engineering. The official documentation describes two recommended prompts – both of them suggest that…
-
Hacker News: Gemma3 Function Calling
Source URL: https://ai.google.dev/gemma/docs/capabilities/function-calling Source: Hacker News Title: Gemma3 Function Calling Feedly Summary: Comments AI Summary and Description: Yes Summary: The provided text discusses function calling with a generative AI model named Gemma, including its structure, usage, and recommendations for code execution. This information is critical for professionals working with AI systems, particularly in understanding how…