Tag: llms

  • Simon Willison’s Weblog: llama-3.2-webgpu

    Source URL: https://simonwillison.net/2024/Sep/30/llama-32-webgpu/#atom-everything Source: Simon Willison’s Weblog Title: llama-3.2-webgpu Feedly Summary: llama-3.2-webgpu Llama 3.2 1B is a really interesting models, given its 128,000 token input and its tiny size (barely more than a GB). This page loads a 1.24GB q4f16 ONNX build of the Llama-3.2-1B-Instruct model and runs it with a React-powered chat interface directly…

  • Simon Willison’s Weblog: Updated production-ready Gemini models

    Source URL: https://simonwillison.net/2024/Sep/24/gemini-models/#atom-everything Source: Simon Willison’s Weblog Title: Updated production-ready Gemini models Feedly Summary: Updated production-ready Gemini models Two new models from Google Gemini today: gemini-1.5-pro-002 and gemini-1.5-flash-002. Their -latest aliases will update to these new models in “the next few days", and new -001 suffixes can be used to stick with the older models.…

  • Simon Willison’s Weblog: How streaming LLM APIs work

    Source URL: https://simonwillison.net/2024/Sep/22/how-streaming-llm-apis-work/#atom-everything Source: Simon Willison’s Weblog Title: How streaming LLM APIs work Feedly Summary: How streaming LLM APIs work New TIL. I used curl to explore the streaming APIs provided by OpenAI, Anthropic and Google Gemini and wrote up detailed notes on what I learned. Tags: apis, http, llms, json AI Summary and Description:…

  • Simon Willison’s Weblog: Introducing Contextual Retrieval

    Source URL: https://simonwillison.net/2024/Sep/20/introducing-contextual-retrieval/#atom-everything Source: Simon Willison’s Weblog Title: Introducing Contextual Retrieval Feedly Summary: Introducing Contextual Retrieval Here’s an interesting new embedding/RAG technique, described by Anthropic but it should work for any embedding model against any other LLM. One of the big challenges in implementing semantic search against vector embeddings – often used as part of…

  • Simon Willison’s Weblog: Quoting Terrence Tao

    Source URL: https://simonwillison.net/2024/Sep/15/terrence-tao/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Terrence Tao Feedly Summary: [… OpenAI’s o1] could work its way to a correct (and well-written) solution if provided a lot of hints and prodding, but did not generate the key conceptual ideas on its own, and did make some non-trivial mistakes. The experience seemed roughly…

  • Simon Willison’s Weblog: Notes on OpenAI’s new o1 chain-of-thought models

    Source URL: https://simonwillison.net/2024/Sep/12/openai-o1/ Source: Simon Willison’s Weblog Title: Notes on OpenAI’s new o1 chain-of-thought models Feedly Summary: OpenAI released two major new preview models today: o1-preview and o1-mini (that mini one is also a preview, despite the name) – previously rumored as having the codename “strawberry". There’s a lot to understand about these models –…

  • Simon Willison’s Weblog: Pixtral 12B

    Source URL: https://simonwillison.net/2024/Sep/11/pixtral/#atom-everything Source: Simon Willison’s Weblog Title: Pixtral 12B Feedly Summary: Pixtral 12B Pixtral 12B Mistral finally have a multi-model (image + text) vision LLM! I linked to their tweet, but there’s not much to see there – in now classic Mistral style they released the new model with an otherwise unlabeled link to…

  • Simon Willison’s Weblog: Calling LLMs from client-side JavaScript, converting PDFs to HTML + weeknotes

    Source URL: https://simonwillison.net/2024/Sep/6/weeknotes/ Source: Simon Willison’s Weblog Title: Calling LLMs from client-side JavaScript, converting PDFs to HTML + weeknotes Feedly Summary: I’ve been having a bunch of fun taking advantage of CORS-enabled LLM APIs to build client-side JavaScript applications that access LLMs directly. I also span up a new Datasette plugin for advanced permission management.…

  • Simon Willison’s Weblog: Anthropic’s Prompt Engineering Interactive Tutorial

    Source URL: https://simonwillison.net/2024/Aug/30/anthropic-prompt-engineering-interactive-tutorial/#atom-everything Source: Simon Willison’s Weblog Title: Anthropic’s Prompt Engineering Interactive Tutorial Feedly Summary: Anthropic’s Prompt Engineering Interactive Tutorial Anthropic continue their trend of offering the best documentation of any of the leading LLM vendors. This tutorial is delivered as a set of Jupyter notebooks – I used it as an excuse to try…

  • Simon Willison’s Weblog: How Anthropic built Artifacts

    Source URL: https://simonwillison.net/2024/Aug/28/how-anthropic-built-artifacts/#atom-everything Source: Simon Willison’s Weblog Title: How Anthropic built Artifacts Feedly Summary: How Anthropic built Artifacts Gergely Orosz interviews five members of Anthropic about how they built Artifacts on top of Claude 3.5 Sonnet with a small team in just three months. The initial prototype used Streamlit, and the biggest challenge was building…