evaluation techniques – Experimental News Clipping Site

Simon Willison’s Weblog: Exploring Promptfoo via Dave Guarino’s SNAP evals

Apr 24, 2025

—

by

Source URL: https://simonwillison.net/2025/Apr/24/exploring-promptfoo/#atom-everything Source: Simon Willison’s Weblog Title: Exploring Promptfoo via Dave Guarino’s SNAP evals Feedly Summary: I used part three (here’s parts one and two) of Dave Guarino’s series on evaluating how well LLMs can answer questions about SNAP (aka food stamps) as an excuse to explore Promptfoo, an LLM eval tool. SNAP (Supplemental…

The Register: Google to Iran: Yes, we see you using Gemini for phishing and scripting. We’re onto you

Jan 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/01/31/state_spies_google_gemini/ Source: The Register Title: Google to Iran: Yes, we see you using Gemini for phishing and scripting. We’re onto you Feedly Summary: And you, China, Russia, North Korea … Guardrails block malware generation Google says it’s spotted Chinese, Russian, Iranian, and North Korean government agents using its Gemini AI for nefarious purposes,…

Cloud Blog: Supervised Fine Tuning for Gemini: A best practices guide

Jan 7, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/master-gemini-sft/ Source: Cloud Blog Title: Supervised Fine Tuning for Gemini: A best practices guide Feedly Summary: Foundation models such as Gemini have revolutionized how we work, but sometimes they need guidance to excel at specific business tasks. Perhaps their answers are too long, or their summaries miss the mark. That’s where supervised fine-tuning…

Hacker News: Sabotage Evaluations for Frontier Models

Oct 20, 2024

—

by

system automation

in Uncategorized

Source URL: https://www.anthropic.com/research/sabotage-evaluations Source: Hacker News Title: Sabotage Evaluations for Frontier Models Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text outlines a comprehensive series of evaluation techniques developed by the Anthropic Alignment Science team to assess potential sabotage capabilities in AI models. These evaluations are crucial for ensuring the safety and integrity…

Hacker News: LLMs know more than what they say

Aug 19, 2024

—

by

system automation

in Uncategorized

Source URL: https://arjunbansal.substack.com/p/llms-know-more-than-what-they-say Source: Hacker News Title: LLMs know more than what they say Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses advancements in evaluation techniques for generative AI applications, particularly focusing on reducing hallucination occurrences and improving evaluation accuracy through a method called Latent Space Readout (LSR). This approach demonstrates…

Tag: evaluation techniques

Simon Willison’s Weblog: Exploring Promptfoo via Dave Guarino’s SNAP evals

The Register: Google to Iran: Yes, we see you using Gemini for phishing and scripting. We’re onto you

Cloud Blog: Supervised Fine Tuning for Gemini: A best practices guide

Hacker News: Sabotage Evaluations for Frontier Models

Hacker News: LLMs know more than what they say