Tag: batch processing
-
Simon Willison’s Weblog: Anthropic: Message Batches (beta)
Source URL: https://simonwillison.net/2024/Oct/8/anthropic-batch-mode/ Source: Simon Willison’s Weblog Title: Anthropic: Message Batches (beta) Feedly Summary: Anthropic: Message Batches (beta) Anthropic now have a batch mode, allowing you to send prompts to Claude in batches which will be processed within 24 hours (though probably much faster than that) and come at a 50% price discount. This matches…
-
The Register: Cerebras gives waferscale chips inferencing twist, claims 1,800 token per sec generation rates
Source URL: https://www.theregister.com/2024/08/27/cerebras_ai_inference/ Source: The Register Title: Cerebras gives waferscale chips inferencing twist, claims 1,800 token per sec generation rates Feedly Summary: Faster than you can read? More like blink and you’ll miss the hallucination Hot Chips Inference performance in many modern generative AI workloads is usually a function of memory bandwidth rather than compute.…