experimentation – Page 9 – Experimental News Clipping Site

Simon Willison’s Weblog: Mistral Small 3.1

Mar 17, 2025

—

by

Source URL: https://simonwillison.net/2025/Mar/17/mistral-small-31/#atom-everything Source: Simon Willison’s Weblog Title: Mistral Small 3.1 Feedly Summary: Mistral Small 3.1 Mistral Small 3 came out in January and was a notable, genuinely excellent local model that used an Apache 2.0 license. Mistral Small 3.1 offers a significant improvement: it’s multi-modal (images) and has an increased 128,000 token context length,…

Simon Willison’s Weblog: mlx-community/OLMo-2-0325-32B-Instruct-4bit

Mar 16, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Mar/16/olmo2/#atom-everything Source: Simon Willison’s Weblog Title: mlx-community/OLMo-2-0325-32B-Instruct-4bit Feedly Summary: mlx-community/OLMo-2-0325-32B-Instruct-4bit OLMo 2 32B claims to be “the first fully-open model (all data, code, weights, and details are freely available) to outperform GPT3.5-Turbo and GPT-4o mini". Thanks to the MLX project here’s a recipe that worked for me to run it on my Mac,…

Simon Willison’s Weblog: Notes on Google’s Gemma 3

Mar 12, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Mar/12/notes-on-googles-gemma-3/ Source: Simon Willison’s Weblog Title: Notes on Google’s Gemma 3 Feedly Summary: Google’s Gemma team released an impressive new model today (under their not-open-source Gemma license). Gemma 3 comes in four sizes – 1B, 4B, 12B, and 27B – and while 1B is text-only the larger three models are all multi-modal for…

Simon Willison’s Weblog: Here’s how I use LLMs to help me write code

Mar 11, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Mar/11/using-llms-for-code/ Source: Simon Willison’s Weblog Title: Here’s how I use LLMs to help me write code Feedly Summary: Online discussions about using Large Language Models to help write code inevitably produce comments from developers who’s experiences have been disappointing. They often ask what they’re doing wrong – how come some people are reporting…

New York Times – Artificial Intelligence : Lila Sciences Uses A.I. to Turbocharge Scientific Discovery

Mar 10, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.nytimes.com/2025/03/10/technology/ai-science-lab-lila.html Source: New York Times – Artificial Intelligence Title: Lila Sciences Uses A.I. to Turbocharge Scientific Discovery Feedly Summary: An ambitious start-up embodies new optimism that artificial intelligence can turbocharge scientific discovery. AI Summary and Description: Yes Summary: The text highlights the transformative role of artificial intelligence (AI) in scientific discovery and development,…

Hacker News: Koko (YC W22) Is Hiring a CTO / Lead Engineer

Mar 6, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.ycombinator.com/companies/koko-2/jobs/oPgy08B-lead-engineer-cto Source: Hacker News Title: Koko (YC W22) Is Hiring a CTO / Lead Engineer Feedly Summary: Comments AI Summary and Description: Yes Summary: The text outlines the mission and operational framework of Koko, a mental health tech nonprofit employing AI to provide online support for youth. Focusing on ethical and responsible AI…

Hacker News: Using GRPO to Beat o1, o3-mini and R1 at "Temporal Clue"

Mar 6, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://openpipe.ai/blog/using-grpo-to-beat-o1-o3-mini-and-r1-on-temporal-clue Source: Hacker News Title: Using GRPO to Beat o1, o3-mini and R1 at "Temporal Clue" Feedly Summary: Comments AI Summary and Description: Yes Short Summary with Insight: The provided text explores the application of reinforcement learning to enhance the deductive reasoning capabilities of smaller, open-weight models in AI. Specifically, it focuses on…

Hacker News: Prompting Large Language Models in Bash Scripts

Mar 2, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://elijahpotter.dev/articles/prompting_large_language_models_in_bash_scripts Source: Hacker News Title: Prompting Large Language Models in Bash Scripts Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the use of large language models (LLMs) in bash scripts, specifically highlighting a tool called “ofc” that facilitates this integration. It explores innovative uses for LLMs in generating datasets…

Hacker News: The Dino, the Llama, and the Whale (Deno and Jupyter for Local AI Experiments)

Mar 1, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://deno.com/blog/the-dino-llama-and-whale Source: Hacker News Title: The Dino, the Llama, and the Whale (Deno and Jupyter for Local AI Experiments) Feedly Summary: Comments AI Summary and Description: Yes Summary: The text outlines the author’s journey in experimenting with a locally hosted large language model (LLM) using various tools such as Deno, Jupyter Notebook, and…

Simon Willison’s Weblog: llm-anthropic #24: Use new URL parameter to send attachments

Mar 1, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Mar/1/llm-anthropic/#atom-everything Source: Simon Willison’s Weblog Title: llm-anthropic #24: Use new URL parameter to send attachments Feedly Summary: llm-anthropic #24: Use new URL parameter to send attachments Anthropic released a neat quality of life improvement today. Alex Albert: We’ve added the ability to specify a public facing URL as the source for an image…

Tag: experimentation