Tag: teaming

—

by

Source URL: https://slashdot.org/story/25/01/29/2218253/microsoft-makes-deepseeks-r1-model-available-on-azure-ai-and-github?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Microsoft Makes DeepSeek’s R1 Model Available On Azure AI and GitHub Feedly Summary: AI Summary and Description: Yes Summary: Microsoft has enhanced its Azure AI Foundry platform by integrating DeepSeek’s R1 model, facilitating efficient experimentation and deployment of AI applications for developers. The model has passed extensive security evaluations,…

Hacker News: DeepSeek R1 Is Now Available on Azure AI Foundry and GitHub

—

by

Source URL: https://azure.microsoft.com/en-us/blog/deepseek-r1-is-now-available-on-azure-ai-foundry-and-github/ Source: Hacker News Title: DeepSeek R1 Is Now Available on Azure AI Foundry and GitHub Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the availability of DeepSeek R1 in the Azure AI Foundry model catalog, emphasizing the model’s integration into a trusted and scalable platform for businesses. It…

Simon Willison’s Weblog: How we estimate the risk from prompt injection attacks on AI systems

—

by

Source URL: https://simonwillison.net/2025/Jan/29/prompt-injection-attacks-on-ai-systems/ Source: Simon Willison’s Weblog Title: How we estimate the risk from prompt injection attacks on AI systems Feedly Summary: How we estimate the risk from prompt injection attacks on AI systems The “Agentic AI Security Team" at Google DeepMind share some details on how they are researching indirect prompt injection attacks. They…

Cloud Blog: Adversarial Misuse of Generative AI

—

by

Source URL: https://cloud.google.com/blog/topics/threat-intelligence/adversarial-misuse-generative-ai/ Source: Cloud Blog Title: Adversarial Misuse of Generative AI Feedly Summary: Rapid advancements in artificial intelligence (AI) are unlocking new possibilities for the way we work and accelerating innovation in science, technology, and beyond. In cybersecurity, AI is poised to transform digital defense, empowering defenders and enhancing our collective security. Large language…

Google Online Security Blog: How we estimate the risk from prompt injection attacks on AI systems

—

by

Source URL: https://security.googleblog.com/2025/01/how-we-estimate-risk-from-prompt.html Source: Google Online Security Blog Title: How we estimate the risk from prompt injection attacks on AI systems Feedly Summary: AI Summary and Description: Yes Summary: The text discusses emerging security challenges in modern AI systems, specifically focusing on a class of attacks called “indirect prompt injection.” It presents a comprehensive evaluation…

Simon Willison’s Weblog: Introducing Operator

Jan 23, 2025

—

by

Source URL: https://simonwillison.net/2025/Jan/23/introducing-operator/ Source: Simon Willison’s Weblog Title: Introducing Operator Feedly Summary: Introducing Operator OpenAI released their “research preview" today of Operator, a cloud-based browser automation platform rolling out today to $200/month ChatGPT Pro subscribers. They’re calling this their first "agent". In the Operator announcement video Sam Altman defined that notoriously vague term like this:…

OpenAI : Operator System Card

Jan 23, 2025

—

by

Source URL: https://openai.com/index/operator-system-card Source: OpenAI Title: Operator System Card Feedly Summary: Drawing from OpenAI’s established safety frameworks, this document highlights our multi-layered approach, including model and product mitigations we’ve implemented to protect against prompt engineering and jailbreaks, protect privacy and security, as well as details our external red teaming efforts, safety evaluations, and ongoing work…

Simon Willison’s Weblog: Lessons From Red Teaming 100 Generative AI Products

Jan 18, 2025

—

by