Tag: ai model

  • OpenAI : Trading inference-time compute for adversarial robustness

    Source URL: https://openai.com/index/trading-inference-time-compute-for-adversarial-robustness Source: OpenAI Title: Trading inference-time compute for adversarial robustness Feedly Summary: Trading Inference-Time Compute for Adversarial Robustness AI Summary and Description: Yes Summary: The text explores the trade-offs between inference-time computing demands and adversarial robustness within AI systems, particularly relevant in the context of machine learning and AI security. This topic holds…

  • Simon Willison’s Weblog: r1.py script to run R1 with a min-thinking-tokens parameter

    Source URL: https://simonwillison.net/2025/Jan/22/r1py/ Source: Simon Willison’s Weblog Title: r1.py script to run R1 with a min-thinking-tokens parameter Feedly Summary: r1.py script to run R1 with a min-thinking-tokens parameter Fantastically creative hack by Theia Vogel. The DeepSeek R1 family of models output their chain of thought inside a …</think> block. Theia found that you can intercept…

  • Scott Logic: The UK’s AI Opportunities Action Plan – somewhat quiet on risks

    Source URL: https://blog.scottlogic.com/2025/01/22/the-uks-ai-opportunities-action-plan-somewhat-quiet-on-risks.html Source: Scott Logic Title: The UK’s AI Opportunities Action Plan – somewhat quiet on risks Feedly Summary: Last week the UK government launched their 50-point AI Opportunities Action Plan. The plan is ambitious, but it is something of a mixed bag. Some sizeable and worthwhile investments, alongside others which are quite questionable.…

  • The Register: Google DeepMind CEO says 2025’s the year we start popping pills AI helped invent

    Source URL: https://www.theregister.com/2025/01/22/google_deepmind_ai_drugs/ Source: The Register Title: Google DeepMind CEO says 2025’s the year we start popping pills AI helped invent Feedly Summary: Nobel Prize winner Demis Hassabis thinks human trials will happen soon Clinical trials of the first drugs designed with the help of artificial intelligence could commence this year, Google DeepMind CEO Demis…

  • Simon Willison’s Weblog: Run DeepSeek R1 or V3 with MLX Distributed

    Source URL: https://simonwillison.net/2025/Jan/22/mlx-distributed/ Source: Simon Willison’s Weblog Title: Run DeepSeek R1 or V3 with MLX Distributed Feedly Summary: Run DeepSeek R1 or V3 with MLX Distributed Handy detailed instructions from Awni Hannun on running the enormous DeepSeek R1 or v3 models on a cluster of Macs using the distributed communication feature of Apple’s MLX library.…

  • The Register: OpenAI and pals form biz to spend $500B on its own AI infrastructure

    Source URL: https://www.theregister.com/2025/01/22/openai_stargate_ai_datacenter_company/ Source: The Register Title: OpenAI and pals form biz to spend $500B on its own AI infrastructure Feedly Summary: With help from SoftBank, Oracle, and MGX, Stargate Project plans to outspend Microsoft this year OpenAI, Oracle, SoftBank, and investment firm MGX on Tuesday announced plans to spend $500 billion on AI infrastructure…

  • Slashdot: Managing AI Agents As Employees Is the Challenge of 2025, Says Goldman Sachs CIO

    Source URL: https://it.slashdot.org/story/25/01/21/2213230/managing-ai-agents-as-employees-is-the-challenge-of-2025-says-goldman-sachs-cio Source: Slashdot Title: Managing AI Agents As Employees Is the Challenge of 2025, Says Goldman Sachs CIO Feedly Summary: AI Summary and Description: Yes Summary: The text discusses predictions from Goldman Sachs regarding the evolution of artificial intelligence (AI) in corporate environments, particularly focusing on the integration of AI as active participants…

  • Slashdot: Cutting-Edge Chinese ‘Reasoning’ Model Rivals OpenAI O1

    Source URL: https://slashdot.org/story/25/01/21/2138247/cutting-edge-chinese-reasoning-model-rivals-openai-o1?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Cutting-Edge Chinese ‘Reasoning’ Model Rivals OpenAI O1 Feedly Summary: AI Summary and Description: Yes Summary: The release of DeepSeek’s R1 model family marks a significant advancement in the availability of high-performing AI models, particularly in the realms of math and coding tasks. With an open MIT license, these models…

  • Hacker News: LLMs Demonstrate Behavioral Self-Awareness [pdf]

    Source URL: https://martins1612.github.io/selfaware_paper_betley.pdf Source: Hacker News Title: LLMs Demonstrate Behavioral Self-Awareness [pdf] Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The provided text discusses a study focused on the concept of behavioral self-awareness in Large Language Models (LLMs). The research demonstrates that LLMs can be finetuned to recognize and articulate their learned behaviors, including…

  • Simon Willison’s Weblog: AI mistakes are very different from human mistakes

    Source URL: https://simonwillison.net/2025/Jan/21/ai-mistakes-are-very-different-from-human-mistakes/#atom-everything Source: Simon Willison’s Weblog Title: AI mistakes are very different from human mistakes Feedly Summary: AI mistakes are very different from human mistakes An entertaining and informative read by Bruce Schneier and Nathan E. Sanders. If you want to use an AI model to help with a business problem, it’s not enough…