Tag: embeddings

  • Hacker News: 400x faster embeddings models using static embeddings

    Source URL: https://huggingface.co/blog/static-embeddings Source: Hacker News Title: 400x faster embeddings models using static embeddings Feedly Summary: Comments AI Summary and Description: Yes **Summary:** This blog post discusses a new method to train static embedding models significantly faster than existing state-of-the-art models. These models are suited for various applications, including on-device and in-browser execution, and edge…

  • Hacker News: Don’t use cosine similarity carelessly

    Source URL: https://p.migdal.pl/blog/2025/01/dont-use-cosine-similarity/ Source: Hacker News Title: Don’t use cosine similarity carelessly Feedly Summary: Comments AI Summary and Description: Yes Summary: The text explores the complexities and limitations of using cosine similarity in AI, particularly in the context of vector embeddings derived from language models. It critiques the blind application of cosine similarity to assess…

  • Hacker News: voyage-code-3

    Source URL: https://blog.voyageai.com/2024/12/04/voyage-code-3/ Source: Hacker News Title: voyage-code-3 Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents voyage-code-3, a new embedding model optimized for code retrieval that significantly outperforms existing models in both performance and cost-efficiency. The introduction of Matryoshka learning and advanced quantization techniques allows for reduced storage requirements without compromising…

  • Hacker News: Large Concept Models: Language modeling in a sentence representation space

    Source URL: https://github.com/facebookresearch/large_concept_model Source: Hacker News Title: Large Concept Models: Language modeling in a sentence representation space Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the implementation and experiments related to Large Concept Models (LCMs) as part of language modeling in a semantic representation space. By utilizing SONAR embeddings for multiple…

  • Simon Willison’s Weblog: deepseek-ai/DeepSeek-V3-Base

    Source URL: https://simonwillison.net/2024/Dec/25/deepseek-v3/#atom-everything Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-V3-Base Feedly Summary: deepseek-ai/DeepSeek-V3-Base No model card or announcement yet, but this new model release from Chinese AI lab DeepSeek (an arm of Chinese hedge fund High-Flyer) looks very significant. It’s a huge model – 685B parameters, 687.9 GB on disk (TIL how to size a git-lfs…