Tag: generative
-
Simon Willison’s Weblog: Anomalous Tokens in DeepSeek-V3 and r1
Source URL: https://simonwillison.net/2025/Jan/26/anomalous-tokens-in-deepseek-v3-and-r1/#atom-everything Source: Simon Willison’s Weblog Title: Anomalous Tokens in DeepSeek-V3 and r1 Feedly Summary: Anomalous Tokens in DeepSeek-V3 and r1 Glitch tokens (previously) are tokens or strings that trigger strange behavior in LLMs, hinting at oddities in their tokenizers or model weights. Here’s a fun exploration of them across DeepSeek v3 and R1.…
-
Simon Willison’s Weblog: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens
Source URL: https://simonwillison.net/2025/Jan/26/qwen25-1m/ Source: Simon Willison’s Weblog Title: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens Feedly Summary: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens Very significant new release from Alibaba’s Qwen team. Their openly licensed (sometimes Apache 2, sometimes Qwen license, I’ve had trouble keeping…
-
Simon Willison’s Weblog: ChatGPT Operator system prompt
Source URL: https://simonwillison.net/2025/Jan/26/chatgpt-operator-system-prompt/#atom-everything Source: Simon Willison’s Weblog Title: ChatGPT Operator system prompt Feedly Summary: ChatGPT Operator system prompt Johann Rehberger snagged a copy of the ChatGPT Operator system prompt. As usual, the system prompt doubles as better written documentation than any of the official sources. It asks users for confirmation a lot: ## Confirmations Ask…
-
Hacker News: Why Your AI Product Team Needs an AI Quality Lead
Source URL: https://freeplay.ai/blog/why-your-ai-product-team-needs-an-ai-quality-lead Source: Hacker News Title: Why Your AI Product Team Needs an AI Quality Lead Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the establishment of the “AI Quality Lead” role at Help Scout, highlighting its importance in enhancing AI team’s effectiveness and product quality through domain expertise combined…
-
The Register: What happens when we can’t just build bigger AI datacenters anymore?
Source URL: https://www.theregister.com/2025/01/24/build_bigger_ai_datacenters/ Source: The Register Title: What happens when we can’t just build bigger AI datacenters anymore? Feedly Summary: We stitch together enormous supercomputers from other smaller supercomputers of course Feature Generative AI models have not only exploded in popularity over the past two years, but they’ve also grown at a precipitous rate, necessitating…
-
Cloud Blog: Announcing smaller machine types for A3 High VMs
Source URL: https://cloud.google.com/blog/products/compute/announcing-smaller-machine-types-for-a3-high-vms/ Source: Cloud Blog Title: Announcing smaller machine types for A3 High VMs Feedly Summary: Today, an increasing number of organizations are using GPUs to run inference1 on their AI/ML models. Since the number of GPUs needed to serve a single inference workload varies, organizations need more granularity in the number of GPUs…
-
Simon Willison’s Weblog: Anthropic’s new Citations API
Source URL: https://simonwillison.net/2025/Jan/24/anthropics-new-citations-api/#atom-everything Source: Simon Willison’s Weblog Title: Anthropic’s new Citations API Feedly Summary: Here’s a new API-only feature from Anthropic that requires quite a bit of assembly in order to unlock the value: Introducing Citations on the Anthropic API. Let’s talk about what this is and why it’s interesting. Citations for Retrieval Augmented Generation…
-
Simon Willison’s Weblog: Quoting Dustin Ewers
Source URL: https://simonwillison.net/2025/Jan/24/dustin-ewers/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Dustin Ewers Feedly Summary: AI tools create a significant productivity boost for developers. Different folks report different gains, but most people who try AI code generation recognize its ability to increase velocity. Many people think that means we’re going to need fewer developers, and our industry…