Tag: tokens
-
Simon Willison’s Weblog: Claude 3.5 Haiku price drops by 20%
Source URL: https://simonwillison.net/2024/Dec/5/claude-35-haiku-price-drops-by-20/#atom-everything Source: Simon Willison’s Weblog Title: Claude 3.5 Haiku price drops by 20% Feedly Summary: Claude 3.5 Haiku price drops by 20% Buried in this otherwise quite dry post about Anthropic’s ongoing partnership with AWS: To make this model even more accessible for a wide range of use cases, we’re lowering the price…
-
Simon Willison’s Weblog: datasette-llm-usage
Source URL: https://simonwillison.net/2024/Dec/2/datasette-llm-usage/ Source: Simon Willison’s Weblog Title: datasette-llm-usage Feedly Summary: datasette-llm-usage I released the first alpha of a Datasette plugin to help track LLM usage by other plugins, with the goal of supporting token allowances – both for things like free public apps that stop working after a daily allowance, plus free previews of…
-
Simon Willison’s Weblog: LLM 0.19
Source URL: https://simonwillison.net/2024/Dec/1/llm-019/ Source: Simon Willison’s Weblog Title: LLM 0.19 Feedly Summary: LLM 0.19 I just released version 0.19 of LLM, my Python library and CLI utility for working with Large Language Models. I released 0.18 a couple of weeks ago adding support for calling models from Python asyncio code. 0.19 improves on that, and…
-
Hacker News: How We Optimize LLM Inference for AI Coding Assistant
Source URL: https://www.augmentcode.com/blog/rethinking-llm-inference-why-developer-ai-needs-a-different-approach? Source: Hacker News Title: How We Optimize LLM Inference for AI Coding Assistant Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the challenges and optimization strategies employed by Augment to improve large language model (LLM) inference specifically for coding tasks. It highlights the importance of providing full codebase…
-
Embrace The Red: DeepSeek AI: From Prompt Injection To Account Takeover
Source URL: https://embracethered.com/blog/posts/2024/deepseek-ai-prompt-injection-to-xss-and-account-takeover/ Source: Embrace The Red Title: DeepSeek AI: From Prompt Injection To Account Takeover Feedly Summary: About two weeks ago, DeepSeek released a new AI reasoning model, DeepSeek-R1-Lite. The news quickly gained attention and interest across the AI community due to the reasoning capabilities the Chinese lab announced. However, whenever there is a…
-
Simon Willison’s Weblog: SmolVLM – small yet mighty Vision Language Model
Source URL: https://simonwillison.net/2024/Nov/28/smolvlm/#atom-everything Source: Simon Willison’s Weblog Title: SmolVLM – small yet mighty Vision Language Model Feedly Summary: SmolVLM – small yet mighty Vision Language Model I’ve been having fun playing with this new vision model from the Hugging Face team behind SmolLM. They describe it as: […] a 2B VLM, SOTA for its memory…