Tag: code retrieval

  • Simon Willison’s Weblog: Nomic Embed Code: A State-of-the-Art Code Retriever

    Source URL: https://simonwillison.net/2025/Mar/27/nomic-embed-code/ Source: Simon Willison’s Weblog Title: Nomic Embed Code: A State-of-the-Art Code Retriever Feedly Summary: Nomic Embed Code: A State-of-the-Art Code Retriever Nomic have released a new embedding model that specializes in code, based on their CoRNStack “large-scale high-quality training dataset specifically curated for code retrieval". The nomic-embed-code model is pretty large –…

  • Hacker News: SOTA Code Retrieval with Efficient Code Embedding Models

    Source URL: https://www.qodo.ai/blog/qodo-embed-1-code-embedding-code-retreival/ Source: Hacker News Title: SOTA Code Retrieval with Efficient Code Embedding Models Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces Qodo-Embed-1, a new family of code embedding models that outperforms larger models in code retrieval tasks while maintaining a smaller footprint. It emphasizes the challenges existing models face…

  • Hacker News: Evaluating Code Embedding Models

    Source URL: https://blog.voyageai.com/2024/12/04/code-retrieval-eval/ Source: Hacker News Title: Evaluating Code Embedding Models Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the challenges and limitations within the field of code retrieval, particularly as it pertains to embedding models used in coding assistants. It highlights the need for high-quality benchmarking datasets, identifies typical subtasks…

  • Hacker News: voyage-code-3

    Source URL: https://blog.voyageai.com/2024/12/04/voyage-code-3/ Source: Hacker News Title: voyage-code-3 Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents voyage-code-3, a new embedding model optimized for code retrieval that significantly outperforms existing models in both performance and cost-efficiency. The introduction of Matryoshka learning and advanced quantization techniques allows for reduced storage requirements without compromising…