Tag: transformers

  • Simon Willison’s Weblog: AbsenceBench: Language Models Can’t Tell What’s Missing

    Source URL: https://simonwillison.net/2025/Jun/20/absencebench/#atom-everything Source: Simon Willison’s Weblog Title: AbsenceBench: Language Models Can’t Tell What’s Missing Feedly Summary: AbsenceBench: Language Models Can’t Tell What’s Missing Here’s another interesting result to file under the “jagged frontier" of LLMs, where their strengths and weaknesses are often unintuitive. Long context models have been getting increasingly good at passing "Needle…

  • Simon Willison’s Weblog: Qwen3 Embedding

    Source URL: https://simonwillison.net/2025/Jun/8/qwen3-embedding/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3 Embedding Feedly Summary: Qwen3 Embedding New family of embedding models from Qwen, in three sizes: 0.6B, 4B, 8B – and two categories: Text Embedding and Text Reranking. The full collection can be browsed on Hugging Face. The smallest available model is the 0.6B Q8 one, which…

  • CSA: Exploiting Trusted AI: GPTs in Cyberattacks

    Source URL: https://abnormal.ai/blog/how-attackers-exploit-trusted-ai-tools Source: CSA Title: Exploiting Trusted AI: GPTs in Cyberattacks Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the emergence of malicious AI, particularly focusing on how generative pre-trained transformers (GPTs) are being exploited by cybercriminals. It highlights the potential risks posed by these technologies, including sophisticated fraud tactics and…

  • Simon Willison’s Weblog: Gemini Diffusion

    Source URL: https://simonwillison.net/2025/May/21/gemini-diffusion/ Source: Simon Willison’s Weblog Title: Gemini Diffusion Feedly Summary: Gemini Diffusion Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google’s first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers. Google describe it like this: Traditional autoregressive language models generate text…

  • Slashdot: DeepMind is Holding Back Release of AI Research To Give Google an Edge

    Source URL: https://tech.slashdot.org/story/25/04/01/1648250/deepmind-is-holding-back-release-of-ai-research-to-give-google-an-edge?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: DeepMind is Holding Back Release of AI Research To Give Google an Edge Feedly Summary: AI Summary and Description: Yes Summary: Google DeepMind has altered its publication policy to maintain a competitive advantage in AI research, introducing stringent vetting processes that hinder the release of studies. This shift, aimed…

  • Simon Willison’s Weblog: Nomic Embed Code: A State-of-the-Art Code Retriever

    Source URL: https://simonwillison.net/2025/Mar/27/nomic-embed-code/ Source: Simon Willison’s Weblog Title: Nomic Embed Code: A State-of-the-Art Code Retriever Feedly Summary: Nomic Embed Code: A State-of-the-Art Code Retriever Nomic have released a new embedding model that specializes in code, based on their CoRNStack “large-scale high-quality training dataset specifically curated for code retrieval". The nomic-embed-code model is pretty large –…

  • Hacker News: StarVector: Generating Scalable Vector Graphics Code from Images and Text

    Source URL: https://starvector.github.io/ Source: Hacker News Title: StarVector: Generating Scalable Vector Graphics Code from Images and Text Feedly Summary: Comments AI Summary and Description: Yes Summary: The text details the functionalities and performance of the StarVector models, specifically in generating SVG code from images. It outlines the model’s superiority in translating complex visual elements into…

  • Hacker News: What went wrong with the Alan Turing Institute?

    Source URL: https://www.chalmermagne.com/p/how-not-to-build-an-ai-institute Source: Hacker News Title: What went wrong with the Alan Turing Institute? Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the struggles and criticisms facing the Alan Turing Institute (ATI) in the UK, particularly its failure to adapt to advances in AI, such as generative AI and large…