Tag: llms

  • Simon Willison’s Weblog: Pixtral Large

    Source URL: https://simonwillison.net/2024/Nov/18/pixtral-large/ Source: Simon Willison’s Weblog Title: Pixtral Large Feedly Summary: Pixtral Large New today from Mistral: Today we announce Pixtral Large, a 124B open-weights multimodal model built on top of Mistral Large 2. Pixtral Large is the second model in our multimodal family and demonstrates frontier-level image understanding. The weights are out on…

  • Simon Willison’s Weblog: Qwen: Extending the Context Length to 1M Tokens

    Source URL: https://simonwillison.net/2024/Nov/18/qwen-turbo/#atom-everything Source: Simon Willison’s Weblog Title: Qwen: Extending the Context Length to 1M Tokens Feedly Summary: Qwen: Extending the Context Length to 1M Tokens The new Qwen2.5-Turbo boasts a million token context window (up from 128,000 for Qwen 2.5) and faster performance: Using sparse attention mechanisms, we successfully reduced the time to first…

  • Simon Willison’s Weblog: Quoting Jack Clark

    Source URL: https://simonwillison.net/2024/Nov/18/jack-clark/ Source: Simon Willison’s Weblog Title: Quoting Jack Clark Feedly Summary: The main innovation here is just using more data. Specifically, Qwen2.5 Coder is a continuation of an earlier Qwen 2.5 model. The original Qwen 2.5 model was trained on 18 trillion tokens spread across a variety of languages and tasks (e.g, writing,…

  • CSA: 9 Tips to Improve Unstructured Data Security

    Source URL: https://cloudsecurityalliance.org/articles/9-tips-to-simplify-and-improve-unstructured-data-security Source: CSA Title: 9 Tips to Improve Unstructured Data Security Feedly Summary: AI Summary and Description: Yes Summary: The text outlines significant strategies for managing and securing unstructured data, based on a 2024 Gartner report. These strategies focus on leveraging Data Access Governance and Data Discovery tools, adapting to the changing landscape…

  • Hacker News: Ask HN: Is there any license that is designed to exclude LLMs?

    Source URL: https://news.ycombinator.com/item?id=42170746 Source: Hacker News Title: Ask HN: Is there any license that is designed to exclude LLMs? Feedly Summary: Comments AI Summary and Description: Yes Summary: The text expresses concerns about content harvesting by LLMs (Large Language Models) and discusses potential licensing solutions, highlighting the struggle to protect digital content. The insights are…

  • Simon Willison’s Weblog: llm-gemini 0.4

    Source URL: https://simonwillison.net/2024/Nov/18/llm-gemini-04/#atom-everything Source: Simon Willison’s Weblog Title: llm-gemini 0.4 Feedly Summary: llm-gemini 0.4 New release of my llm-gemini plugin, adding support for asynchronous models (see LLM 0.18), plus the new gemini-exp-1114 model (currently at the top of the Chatbot Arena) and a -o json_object 1 option to force JSON output. I also released llm-claude-3…

  • Hacker News: Why LLMs Within Software Development May Be a Dead End

    Source URL: https://thenewstack.io/why-llms-within-software-development-may-be-a-dead-end/ Source: Hacker News Title: Why LLMs Within Software Development May Be a Dead End Feedly Summary: Comments AI Summary and Description: Yes Summary: The text provides a critical perspective on the limitations of current Large Language Models (LLMs) regarding their composability, explainability, and security implications for software development. It argues that LLMs…

  • Simon Willison’s Weblog: LLM 0.18

    Source URL: https://simonwillison.net/2024/Nov/17/llm-018/#atom-everything Source: Simon Willison’s Weblog Title: LLM 0.18 Feedly Summary: LLM 0.18 New release of LLM. The big new feature is asynchronous model support – you can now use supported models in async Python code like this: import llm model = llm.get_async_model(“gpt-4o") async for chunk in model.prompt( "Five surprising names for a pet…

  • Hacker News: Garak, LLM Vulnerability Scanner

    Source URL: https://github.com/NVIDIA/garak Source: Hacker News Title: Garak, LLM Vulnerability Scanner Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text describes “garak,” a command-line vulnerability scanner specifically designed for large language models (LLMs). This tool aims to uncover various weaknesses in LLMs, such as hallucination, prompt injection attacks, and data leakage. Its development…