Experimental News Clipping Site

Tag: Llama3.1-70b

Simon Willison’s Weblog: Gemini Diffusion

May 21, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/May/21/gemini-diffusion/ Source: Simon Willison’s Weblog Title: Gemini Diffusion Feedly Summary: Gemini Diffusion Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google’s first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers. Google describe it like this: Traditional autoregressive language models generate text…
Simon Willison’s Weblog: Cerebras Coder

Oct 31, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Oct/31/cerebras-coder/#atom-everything Source: Simon Willison’s Weblog Title: Cerebras Coder Feedly Summary: Cerebras Coder Val Town founder Steve Krouse has been building demos on top of the Cerebras API that runs Llama3.1-70b at 2,000 tokens/second. Having a capable LLM with that kind of performance turns out to be really interesting. Cerebras Coder is a demo…