Tag: llm

  • Slashdot: Can You Run the Llama 2 LLM on DOS?

    Source URL: https://tech.slashdot.org/story/25/04/21/0026255/can-you-run-the-llama-2-llm-on-dos Source: Slashdot Title: Can You Run the Llama 2 LLM on DOS? Feedly Summary: AI Summary and Description: Yes Summary: The text revolves around an innovative project by an embedded security researcher who successfully ported Llama 2, a large language model (LLM), to run on vintage DOS machines. This challenges the conventional…

  • Simon Willison’s Weblog: Quoting Ethan Mollick

    Source URL: https://simonwillison.net/2025/Apr/20/ethan-mollick/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Ethan Mollick Feedly Summary: In some tasks, AI is unreliable. In others, it is superhuman. You could, of course, say the same thing about calculators, but it is also clear that AI is different. It is already demonstrating general capabilities and performing a wide range of…

  • Simon Willison’s Weblog: Note on 20th April 2025

    Source URL: https://simonwillison.net/2025/Apr/20/janky-license/#atom-everything Source: Simon Willison’s Weblog Title: Note on 20th April 2025 Feedly Summary: Now that Llama has very real competition in open weight models (Gemma 3, latest Mistrals, DeepSeek, Qwen) I think their janky license is becoming much more of a liability for them. It’s just limiting enough that it could be the…

  • Simon Willison’s Weblog: llm-fragments-github 0.2

    Source URL: https://simonwillison.net/2025/Apr/20/llm-fragments-github/#atom-everything Source: Simon Willison’s Weblog Title: llm-fragments-github 0.2 Feedly Summary: llm-fragments-github 0.2 I upgraded my llm-fragments-github plugin to add a new fragment type called issue. It lets you pull the entire content of a GitHub issue thread into your prompt as a concatenated Markdown file. (If you haven’t seen fragments before I introduced…

  • Simon Willison’s Weblog: Maybe Meta’s Llama claims to be open source because of the EU AI act

    Source URL: https://simonwillison.net/2025/Apr/19/llama-eu-ai-act/#atom-everything Source: Simon Willison’s Weblog Title: Maybe Meta’s Llama claims to be open source because of the EU AI act Feedly Summary: I encountered a theory a while ago that one of the reasons Meta insist on using the term “open source” for their Llama models despite the Llama license not actually conforming…

  • Simon Willison’s Weblog: Claude Code: Best practices for agentic coding

    Source URL: https://simonwillison.net/2025/Apr/19/claude-code-best-practices/#atom-everything Source: Simon Willison’s Weblog Title: Claude Code: Best practices for agentic coding Feedly Summary: Claude Code: Best practices for agentic coding Extensive new documentation from Anthropic on how to get the best results out of their Claude Code CLI coding agent tool, which includes this fascinating tip: We recommend using the word…

  • Simon Willison’s Weblog: Gemma 3 QAT Models

    Source URL: https://simonwillison.net/2025/Apr/19/gemma-3-qat-models/ Source: Simon Willison’s Weblog Title: Gemma 3 QAT Models Feedly Summary: Gemma 3 QAT Models Interesting release from Google, as a follow-up to Gemma 3 from last month: To make Gemma 3 even more accessible, we are announcing new versions optimized with Quantization-Aware Training (QAT) that dramatically reduces memory requirements while maintaining…

  • Simon Willison’s Weblog: Quoting Andrew Ng

    Source URL: https://simonwillison.net/2025/Apr/18/andrew-ng/ Source: Simon Willison’s Weblog Title: Quoting Andrew Ng Feedly Summary: To me, a successful eval meets the following criteria. Say, we currently have system A, and we might tweak it to get a system B: If A works significantly better than B according to a skilled human judge, the eval should give…

  • Simon Willison’s Weblog: Image segmentation using Gemini 2.5

    Source URL: https://simonwillison.net/2025/Apr/18/gemini-image-segmentation/ Source: Simon Willison’s Weblog Title: Image segmentation using Gemini 2.5 Feedly Summary: Max Woolf pointed out this new feature of the Gemini 2.5 series in a comment on Hacker News: One hidden note from Gemini 2.5 Flash when diving deep into the documentation: for image inputs, not only can the model be…