Tag: large language models
-
Hacker News: Task-Specific LLM Evals That Do and Don’t Work
Source URL: https://eugeneyan.com/writing/evals/ Source: Hacker News Title: Task-Specific LLM Evals That Do and Don’t Work Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents a comprehensive overview of evaluation metrics for machine learning tasks, specifically focusing on classification, summarization, and translation within the context of large language models (LLMs). It highlights the…
-
The Register: Microsoft dangles $10K for hackers to hijack LLM email service
Source URL: https://www.theregister.com/2024/12/09/microsoft_llm_prompt_injection_challenge/ Source: The Register Title: Microsoft dangles $10K for hackers to hijack LLM email service Feedly Summary: Outsmart an AI, win a little Christmas cash Microsoft and friends have challenged AI hackers to break a simulated LLM-integrated email client with a prompt injection attack – and the winning teams will share a $10,000…
-
Embrace The Red: Terminal DiLLMa: LLM-powered Apps Can Hijack Your Terminal Via Prompt Injection
Source URL: https://embracethered.com/blog/posts/2024/terminal-dillmas-prompt-injection-ansi-sequences/ Source: Embrace The Red Title: Terminal DiLLMa: LLM-powered Apps Can Hijack Your Terminal Via Prompt Injection Feedly Summary: Last week Leon Derczynski described how LLMs can output ANSI escape codes. These codes, also known as control characters, are interpreted by terminal emulators and modify behavior. This discovery resonates with areas I had…
-
Simon Willison’s Weblog: Roaming RAG – make the model find the answers
Source URL: https://simonwillison.net/2024/Dec/6/roaming-rag/#atom-everything Source: Simon Willison’s Weblog Title: Roaming RAG – make the model find the answers Feedly Summary: Roaming RAG – make the model find the answers Neat new RAG technique (with a snappy name) from John Berryman: The big idea of Roaming RAG is to craft a simple LLM application so that the…
-
Hacker News: AmpereOne: Cores Are the New MHz
Source URL: https://www.jeffgeerling.com/blog/2024/ampereone-cores-are-new-mhz Source: Hacker News Title: AmpereOne: Cores Are the New MHz Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text provides an in-depth examination of the Supermicro ARS-211ME-FNR server equipped with the 192-core AmpereOne A192-32X CPU, focusing on its design and performance metrics. The analysis highlights how advancements in core technology…
-
Simon Willison’s Weblog: New Pleias 1.0 LLMs trained exclusively on openly licensed data
Source URL: https://simonwillison.net/2024/Dec/5/pleias-llms/#atom-everything Source: Simon Willison’s Weblog Title: New Pleias 1.0 LLMs trained exclusively on openly licensed data Feedly Summary: New Pleias 1.0 LLMs trained exclusively on openly licensed data I wrote about the Common Corpus public domain dataset back in March. Now Pleias, the team behind Common Corpus, have released the first family of…