Tag: safetensors
-
Cloud Blog: Scalable AI starts with storage: Guide to model artifact strategies
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/scalable-ai-starts-with-storage-guide-to-model-artifact-strategies/ Source: Cloud Blog Title: Scalable AI starts with storage: Guide to model artifact strategies Feedly Summary: Managing large model artifacts is a common bottleneck in MLOps. Baking models into container images leads to slow, monolithic deployments, and downloading them at startup introduces significant delays. This guide explores a better way: decoupling your…
-
Simon Willison’s Weblog: Comma v0.1 1T and 2T – 7B LLMs trained on openly licensed text
Source URL: https://simonwillison.net/2025/Jun/7/comma/#atom-everything Source: Simon Willison’s Weblog Title: Comma v0.1 1T and 2T – 7B LLMs trained on openly licensed text Feedly Summary: It’s been a long time coming, but we finally have some promising LLMs to try out which are trained entirely on openly licensed text! EleutherAI released the Pile four and a half…
-
Simon Willison’s Weblog: deepseek-ai/DeepSeek-V3-0324
Source URL: https://simonwillison.net/2025/Mar/24/deepseek/ Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-V3-0324 Feedly Summary: deepseek-ai/DeepSeek-V3-0324 Chinese AI lab DeepSeek just released the latest version of their enormous DeepSeek v3 model, baking the release date into the name DeepSeek-V3-0324. The license is MIT, the README is empty and the release adds up a to a total of 641 GB…
-
Simon Willison’s Weblog: deepseek-ai/DeepSeek-V3-0324
Source URL: https://simonwillison.net/2025/Mar/24/deepseek/ Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-V3-0324 Feedly Summary: deepseek-ai/DeepSeek-V3-0324 Chinese AI lab DeepSeek just released the latest version of their enormous DeepSeek v3 model, baking the release date into the name DeepSeek-V3-0324. The license is MIT, the README is empty and the release adds up a to a total of 641 GB…
-
Hacker News: Transposing Tensor Files
Source URL: https://mmapped.blog/posts/33-transposing-tensor-files.html Source: Hacker News Title: Transposing Tensor Files Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the design and functionality of machine learning serialization formats, focusing on the challenges of the ONNX format and introducing an improved alternative called “tensorsafe”. Specifically, it highlights how tensorsafe resolves issues found in…