Tag: Well
-
Cisco Talos Blog: Seasoning email threats with hidden text salting
Source URL: https://blog.talosintelligence.com/seasoning-email-threats-with-hidden-text-salting/ Source: Cisco Talos Blog Title: Seasoning email threats with hidden text salting Feedly Summary: Hidden text salting is a simple yet effective technique for bypassing email parsers, confusing spam filters, and evading detection engines that rely on keywords. Cisco Talos observed an increase in the number of email threats leveraging hidden text…
-
Simon Willison’s Weblog: Anthropic’s new Citations API
Source URL: https://simonwillison.net/2025/Jan/24/anthropics-new-citations-api/#atom-everything Source: Simon Willison’s Weblog Title: Anthropic’s new Citations API Feedly Summary: Here’s a new API-only feature from Anthropic that requires quite a bit of assembly in order to unlock the value: Introducing Citations on the Anthropic API. Let’s talk about what this is and why it’s interesting. Citations for Retrieval Augmented Generation…
-
The Register: Mental toll: Scale AI, Outlier sued by humans paid to steer AI away from our darkest depths
Source URL: https://www.theregister.com/2025/01/24/scale_ai_outlier_sued_over/ Source: The Register Title: Mental toll: Scale AI, Outlier sued by humans paid to steer AI away from our darkest depths Feedly Summary: Who guards the guardrail makers? Not the bosses who hire them, it’s alleged Scale AI, which labels training data for machine-learning models, was sued this month, alongside labor platform…
-
OpenAI : Operator System Card
Source URL: https://openai.com/index/operator-system-card Source: OpenAI Title: Operator System Card Feedly Summary: Drawing from OpenAI’s established safety frameworks, this document highlights our multi-layered approach, including model and product mitigations we’ve implemented to protect against prompt engineering and jailbreaks, protect privacy and security, as well as details our external red teaming efforts, safety evaluations, and ongoing work…
-
The Register: Asus lets processor security fix slip out early, AMD confirms patch in progress
Source URL: https://www.theregister.com/2025/01/23/asus_amd_processor_fix/ Source: The Register Title: Asus lets processor security fix slip out early, AMD confirms patch in progress Feedly Summary: Answers on a postcard to what ‘Microcode Signature Verification Vulnerability’ might mean AMD has confirmed at least some of its microprocessors suffer a microcode-related security vulnerability, the existence of which accidentally emerged this…
-
Simon Willison’s Weblog: Trading Inference-Time Compute for Adversarial Robustness
Source URL: https://simonwillison.net/2025/Jan/22/trading-inference-time-compute/ Source: Simon Willison’s Weblog Title: Trading Inference-Time Compute for Adversarial Robustness Feedly Summary: Trading Inference-Time Compute for Adversarial Robustness Brand new research paper from OpenAI, exploring how inference-scaling “reasoning" models such as o1 might impact the search for improved security with respect to things like prompt injection. We conduct experiments on the…
-
Wired: This New AI Search Engine Has a Gimmick: Humans Answering Questions
Source URL: https://www.wired.com/story/this-new-ai-search-engine-has-a-gimmick-humans-answering-questions/ Source: Wired Title: This New AI Search Engine Has a Gimmick: Humans Answering Questions Feedly Summary: A new AI-powered search engine called Pearl is launching today, with an unusual pitch: It promises to connect you with an actual human expert if the AI answer sucks. WIRED gave it a spin. AI Summary…