Tag: experimentation

  • Simon Willison’s Weblog: Mistral Small 3.1

    Source URL: https://simonwillison.net/2025/Mar/17/mistral-small-31/#atom-everything Source: Simon Willison’s Weblog Title: Mistral Small 3.1 Feedly Summary: Mistral Small 3.1 Mistral Small 3 came out in January and was a notable, genuinely excellent local model that used an Apache 2.0 license. Mistral Small 3.1 offers a significant improvement: it’s multi-modal (images) and has an increased 128,000 token context length,…

  • Simon Willison’s Weblog: mlx-community/OLMo-2-0325-32B-Instruct-4bit

    Source URL: https://simonwillison.net/2025/Mar/16/olmo2/#atom-everything Source: Simon Willison’s Weblog Title: mlx-community/OLMo-2-0325-32B-Instruct-4bit Feedly Summary: mlx-community/OLMo-2-0325-32B-Instruct-4bit OLMo 2 32B claims to be “the first fully-open model (all data, code, weights, and details are freely available) to outperform GPT3.5-Turbo and GPT-4o mini". Thanks to the MLX project here’s a recipe that worked for me to run it on my Mac,…

  • Simon Willison’s Weblog: Notes on Google’s Gemma 3

    Source URL: https://simonwillison.net/2025/Mar/12/notes-on-googles-gemma-3/ Source: Simon Willison’s Weblog Title: Notes on Google’s Gemma 3 Feedly Summary: Google’s Gemma team released an impressive new model today (under their not-open-source Gemma license). Gemma 3 comes in four sizes – 1B, 4B, 12B, and 27B – and while 1B is text-only the larger three models are all multi-modal for…

  • Hacker News: Koko (YC W22) Is Hiring a CTO / Lead Engineer

    Source URL: https://www.ycombinator.com/companies/koko-2/jobs/oPgy08B-lead-engineer-cto Source: Hacker News Title: Koko (YC W22) Is Hiring a CTO / Lead Engineer Feedly Summary: Comments AI Summary and Description: Yes Summary: The text outlines the mission and operational framework of Koko, a mental health tech nonprofit employing AI to provide online support for youth. Focusing on ethical and responsible AI…

  • Hacker News: Using GRPO to Beat o1, o3-mini and R1 at "Temporal Clue"

    Source URL: https://openpipe.ai/blog/using-grpo-to-beat-o1-o3-mini-and-r1-on-temporal-clue Source: Hacker News Title: Using GRPO to Beat o1, o3-mini and R1 at "Temporal Clue" Feedly Summary: Comments AI Summary and Description: Yes Short Summary with Insight: The provided text explores the application of reinforcement learning to enhance the deductive reasoning capabilities of smaller, open-weight models in AI. Specifically, it focuses on…

  • Hacker News: Prompting Large Language Models in Bash Scripts

    Source URL: https://elijahpotter.dev/articles/prompting_large_language_models_in_bash_scripts Source: Hacker News Title: Prompting Large Language Models in Bash Scripts Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the use of large language models (LLMs) in bash scripts, specifically highlighting a tool called “ofc” that facilitates this integration. It explores innovative uses for LLMs in generating datasets…

  • Hacker News: The Dino, the Llama, and the Whale (Deno and Jupyter for Local AI Experiments)

    Source URL: https://deno.com/blog/the-dino-llama-and-whale Source: Hacker News Title: The Dino, the Llama, and the Whale (Deno and Jupyter for Local AI Experiments) Feedly Summary: Comments AI Summary and Description: Yes Summary: The text outlines the author’s journey in experimenting with a locally hosted large language model (LLM) using various tools such as Deno, Jupyter Notebook, and…

  • Simon Willison’s Weblog: llm-anthropic #24: Use new URL parameter to send attachments

    Source URL: https://simonwillison.net/2025/Mar/1/llm-anthropic/#atom-everything Source: Simon Willison’s Weblog Title: llm-anthropic #24: Use new URL parameter to send attachments Feedly Summary: llm-anthropic #24: Use new URL parameter to send attachments Anthropic released a neat quality of life improvement today. Alex Albert: We’ve added the ability to specify a public facing URL as the source for an image…