Tag: Cache

  • Simon Willison’s Weblog: Gemini 2.5 Models now support implicit caching

    Source URL: https://simonwillison.net/2025/May/9/gemini-implicit-caching/#atom-everything Source: Simon Willison’s Weblog Title: Gemini 2.5 Models now support implicit caching Feedly Summary: Gemini 2.5 Models now support implicit caching I just spotted a cacheTokensDetails key in the token usage JSON while running a long chain of prompts against Gemini 2.5 Flash – despite not configuring caching myself: {“cachedContentTokenCount": 200658, "promptTokensDetails":…

  • Simon Willison’s Weblog: Qwen3-8B

    Source URL: https://simonwillison.net/2025/May/2/qwen3-8b/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3-8B Feedly Summary: Having tried a few of the Qwen 3 models now my favorite is a bit of a surprise to me: I’m really enjoying Qwen3-8B. I’ve been running prompts through the MLX 4bit quantized version, mlx-community/Qwen3-8B-4bit. I’m using llm-mlx like this: llm install llm-mlx llm…

  • Simon Willison’s Weblog: Exploring Promptfoo via Dave Guarino’s SNAP evals

    Source URL: https://simonwillison.net/2025/Apr/24/exploring-promptfoo/#atom-everything Source: Simon Willison’s Weblog Title: Exploring Promptfoo via Dave Guarino’s SNAP evals Feedly Summary: I used part three (here’s parts one and two) of Dave Guarino’s series on evaluating how well LLMs can answer questions about SNAP (aka food stamps) as an excuse to explore Promptfoo, an LLM eval tool. SNAP (Supplemental…

  • The Register: Vector search is the new black for enterprise databases

    Source URL: https://www.theregister.com/2025/04/24/database_vector_search/ Source: The Register Title: Vector search is the new black for enterprise databases Feedly Summary: Software slingers from Redis to Teradata are bolting on smarts to stay relevant in GenAI era About two years ago, popular cache database Redis was among a wave of vendors that added vector search capabilities to their…

  • AWS News Blog: New Amazon EC2 Graviton4-based instances with NVMe SSD storage

    Source URL: https://aws.amazon.com/blogs/aws/new-amazon-ec2-graviton4-based-instances-with-nvme-ssd-storage/ Source: AWS News Blog Title: New Amazon EC2 Graviton4-based instances with NVMe SSD storage Feedly Summary: AWS introduces new EC2 instance families (C8gd, M8gd, R8gd) powered by Graviton4 processors with NVMe SSD storage, offering up to 30% better performance, 3x more vCPUs and memory, and up to 11.4TB local storage compared to…

  • Cloud Blog: Supercharge your data the open-source way: Memorystore for Valkey is now GA

    Source URL: https://cloud.google.com/blog/products/databases/announcing-general-availability-of-memorystore-for-valkey/ Source: Cloud Blog Title: Supercharge your data the open-source way: Memorystore for Valkey is now GA Feedly Summary: Editor’s note: Ping Xie is a Valkey maintainer on the Valkey Technical Steering Committee (TSC). Memorystore, Google Cloud’s fully managed in-memory service for Valkey, Redis and Memcached, plays an increasingly important role in our…

  • The Register: Return of Redis creator bears fruit with vector set data type

    Source URL: https://www.theregister.com/2025/04/10/return_of_redis_creator/ Source: The Register Title: Return of Redis creator bears fruit with vector set data type Feedly Summary: LLM query caching also lands soon The return of Redis creator Salvatore Sanfilippo has borne fruit in the form of a new data type – vector sets – for the widely used cache-turned-multi-model database.… AI…