Tag: Ethics

  • Simon Willison’s Weblog: Quoting Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

    Source URL: https://simonwillison.net/2025/Feb/25/emergent-misalignment/ Source: Simon Willison’s Weblog Title: Quoting Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs Feedly Summary: In our experiment, a model is finetuned to output insecure code without disclosing this to the user. The resulting model acts misaligned on a broad range of prompts that are unrelated to coding: it asserts…

  • Simon Willison’s Weblog: Deep research System Card

    Source URL: https://simonwillison.net/2025/Feb/25/deep-research-system-card/#atom-everything Source: Simon Willison’s Weblog Title: Deep research System Card Feedly Summary: Deep research System Card OpenAI are rolling out their Deep research “agentic" research tool to their $20/month ChatGPT Plus users today, who get 10 queries a month. $200/month ChatGPT Pro gets 120 uses. Deep research is the best version of this…

  • Hacker News: DOGE will use AI to assess the responses of federal workers

    Source URL: https://www.nbcnews.com/politics/doge/federal-workers-agencies-push-back-elon-musks-email-ultimatum-rcna193439 Source: Hacker News Title: DOGE will use AI to assess the responses of federal workers Feedly Summary: Comments AI Summary and Description: Yes Summary: The provided text discusses a controversial email sent by the U.S. Office of Personnel Management, orchestrated by Elon Musk, directing federal employees to report their weekly accomplishments. The…

  • Schneier on Security: More Research Showing AI Breaking the Rules

    Source URL: https://www.schneier.com/blog/archives/2025/02/more-research-showing-ai-breaking-the-rules.html Source: Schneier on Security Title: More Research Showing AI Breaking the Rules Feedly Summary: These researchers had LLMs play chess against better opponents. When they couldn’t win, they sometimes resorted to cheating. Researchers gave the models a seemingly impossible task: to win against Stockfish, which is one of the strongest chess engines…

  • Slashdot: Mozilla Wans to Expand from Firefox to Open-Source AI and Privacy-Respecting Ads

    Source URL: https://tech.slashdot.org/story/25/02/23/067249/mozilla-wans-to-expand-from-firefox-to-open-source-ai-and-privacy-respecting-ads?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Mozilla Wans to Expand from Firefox to Open-Source AI and Privacy-Respecting Ads Feedly Summary: AI Summary and Description: Yes Summary: The text discusses Mozilla’s strategic direction under President Mark Surman, focusing on enhancing its Firefox browser while integrating generative AI features to maintain relevance in an evolving tech landscape.…

  • Simon Willison’s Weblog: Quoting Joanna Bryson

    Source URL: https://simonwillison.net/2025/Feb/20/joanna-bryson/ Source: Simon Willison’s Weblog Title: Quoting Joanna Bryson Feedly Summary: There are contexts in which it is immoral to use generative AI. For example, if you are a judge responsible for grounding a decision in law, you cannot rest that on an approximation of previous cases unknown to you. You want an…

  • Wired: I’m Not Convinced Ethical Generative AI Currently Exists

    Source URL: https://www.wired.com/story/the-prompt-ethical-generative-ai-does-not-exist/ Source: Wired Title: I’m Not Convinced Ethical Generative AI Currently Exists Feedly Summary: WIRED’s advice columnist considers whether some AI tools are more ethical than others, and if developers can make AI wiser. AI Summary and Description: Yes Summary: The text discusses the ethical implications surrounding generative AI tools, focusing on the…

  • The Register: Microsoft researchers promise entire game worlds made from AI slop

    Source URL: https://www.theregister.com/2025/02/19/microsoft_genai_game_dev_model/ Source: The Register Title: Microsoft researchers promise entire game worlds made from AI slop Feedly Summary: WHAM, bam, no thank you, ma’am? Researchers have produced a generative AI tool they say can create a three-dimensional game world to help developers design and tweak gameplay.… AI Summary and Description: Yes Summary: Researchers from…