Tag: Deepseek v3
-
Simon Willison’s Weblog: Codestral 25.01
Source URL: https://simonwillison.net/2025/Jan/13/codestral-2501/ Source: Simon Willison’s Weblog Title: Codestral 25.01 Feedly Summary: Codestral 25.01 Brand new code-focused model from Mistral. Unlike the first Codestral this one isn’t (yet) available as open weights. The model has a 256k token context – a new record for Mistral. The new model scored an impressive joint first place with…
-
Hacker News: Notes on the New Deepseek v3
Source URL: https://composio.dev/blog/notes-on-new-deepseek-v3/ Source: Hacker News Title: Notes on the New Deepseek v3 Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the release of Deepseek’s v3 model, a 607B mixture-of-experts model that showcases exceptional performance, surpassing both open-source and proprietary competitors at a significantly lower training cost. It highlights the engineering…