Tag: json

  • Simon Willison’s Weblog: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet

    Source URL: https://simonwillison.net/2025/Apr/14/gpt-4-1/ Source: Simon Willison’s Weblog Title: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet Feedly Summary: OpenAI introduced three new models this morning: GPT-4.1, GPT-4.1 mini and GPT-4.1 nano. These are API-only models right now, not available through the ChatGPT interface (though you can try them out…

  • Simon Willison’s Weblog: An LLM Query Understanding Service

    Source URL: https://simonwillison.net/2025/Apr/9/an-llm-query-understanding-service/#atom-everything Source: Simon Willison’s Weblog Title: An LLM Query Understanding Service Feedly Summary: An LLM Query Understanding Service Doug Turnbull recently wrote about how all search is structured now: Many times, even a small open source LLM will be able to turn a search query into reasonable structure at relatively low cost. In…

  • Simon Willison’s Weblog: Nomic Embed Code: A State-of-the-Art Code Retriever

    Source URL: https://simonwillison.net/2025/Mar/27/nomic-embed-code/ Source: Simon Willison’s Weblog Title: Nomic Embed Code: A State-of-the-Art Code Retriever Feedly Summary: Nomic Embed Code: A State-of-the-Art Code Retriever Nomic have released a new embedding model that specializes in code, based on their CoRNStack “large-scale high-quality training dataset specifically curated for code retrieval". The nomic-embed-code model is pretty large –…

  • Cloud Blog: Build richer gen AI experiences using model endpoint management

    Source URL: https://cloud.google.com/blog/products/databases/use-model-endpoint-management-on-alloydb/ Source: Cloud Blog Title: Build richer gen AI experiences using model endpoint management Feedly Summary: Model endpoint management is available on AlloyDB, AlloyDB Omni and Cloud SQL for PostgreSQL. Model endpoint management helps developers to build new experiences using SQL and provides a flexible interface to call gen AI models running anywhere…

  • Simon Willison’s Weblog: Function calling with Gemma

    Source URL: https://simonwillison.net/2025/Mar/26/function-calling-with-gemma/#atom-everything Source: Simon Willison’s Weblog Title: Function calling with Gemma Feedly Summary: Function calling with Gemma Google’s Gemma 3 model (the 27B variant is particularly capable, I’ve been trying it out via Ollama) supports function calling exclusively through prompt engineering. The official documentation describes two recommended prompts – both of them suggest that…

  • Hacker News: Gemma3 Function Calling

    Source URL: https://ai.google.dev/gemma/docs/capabilities/function-calling Source: Hacker News Title: Gemma3 Function Calling Feedly Summary: Comments AI Summary and Description: Yes Summary: The provided text discusses function calling with a generative AI model named Gemma, including its structure, usage, and recommendations for code execution. This information is critical for professionals working with AI systems, particularly in understanding how…