Tag: transformers
-
Hacker News: Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Source URL: https://arxiv.org/abs/2402.12875 Source: Hacker News Title: Chain of Thought Empowers Transformers to Solve Inherently Serial Problems Feedly Summary: Comments AI Summary and Description: Yes Summary: The paper discusses the concept of Chain of Thought (CoT) applied to large language models (LLMs), demonstrating how it enhances their capabilities, particularly in arithmetic and symbolic reasoning tasks.…
-
Simon Willison’s Weblog: SQL injection-like attack on LLMs with special tokens
Source URL: https://simonwillison.net/2024/Aug/20/sql-injection-like-attack-on-llms-with-special-tokens/#atom-everything Source: Simon Willison’s Weblog Title: SQL injection-like attack on LLMs with special tokens Feedly Summary: SQL injection-like attack on LLMs with special tokens Andrej Karpathy explains something that’s been confusing me for the best part of a year: The decision by LLM tokenizers to parse special tokens in the input string (,…