Tag: generative

  • Simon Willison’s Weblog: Quoting Ai2

    Source URL: https://simonwillison.net/2025/Mar/13/ai2/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Ai2 Feedly Summary: Today we release OLMo 2 32B, the most capable and largest model in the OLMo 2 family, scaling up the OLMo 2 training recipe used for our 7B and 13B models released in November. It is trained up to 6T tokens and post-trained…

  • Simon Willison’s Weblog: Anthropic API: Text editor tool

    Source URL: https://simonwillison.net/2025/Mar/13/anthropic-api-text-editor-tool/ Source: Simon Willison’s Weblog Title: Anthropic API: Text editor tool Feedly Summary: Anthropic API: Text editor tool Anthropic released a new “tool" today for text editing. It looks similar to the tool they offered as part of their computer use beta API, and the trick they’ve been using for a while in…

  • Simon Willison’s Weblog: Introducing Command A: Max performance, minimal compute

    Source URL: https://simonwillison.net/2025/Mar/13/command-a/#atom-everything Source: Simon Willison’s Weblog Title: Introducing Command A: Max performance, minimal compute Feedly Summary: Introducing Command A: Max performance, minimal compute New LLM release from Cohere. It’s interesting to see which aspects of the model they’re highlighting, as an indicator of what their commercial customers value the most (highlight mine): Command A…

  • The Register: OpenAI asks Uncle Sam to let it scrape everything, stop other countries complaining

    Source URL: https://www.theregister.com/2025/03/13/openai_data_copyright/ Source: The Register Title: OpenAI asks Uncle Sam to let it scrape everything, stop other countries complaining Feedly Summary: The rest of the world doesn’t think ‘fair use’ is fair but we should make ’em OpenAI wants the US government to ensure it has access to any data it wants to train…

  • Scott Logic: Beyond the Hype: Is architecture for AI even necessary?

    Source URL: https://blog.scottlogic.com/2025/03/13/beyond-the-hype-is-architecture-for-ai-even-necessary.html Source: Scott Logic Title: Beyond the Hype: Is architecture for AI even necessary? Feedly Summary: In this episode, I’m joined by colleagues Jess McEvoy and James Heward, and Atom Bank’s Head of AI and Data Science, Russell Collingham, to explore the transformative impact of generative AI and the critical role of architecture…

  • Slashdot: Google Claims Gemma 3 Reaches 98% of DeepSeek’s Accuracy Using Only One GPU

    Source URL: https://news.slashdot.org/story/25/03/13/0010231/google-claims-gemma-3-reaches-98-of-deepseeks-accuracy-using-only-one-gpu?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Google Claims Gemma 3 Reaches 98% of DeepSeek’s Accuracy Using Only One GPU Feedly Summary: AI Summary and Description: Yes Summary: Google’s new open-source AI model, Gemma 3, boasts impressive performance comparable to DeepSeek AI’s R1 while utilizing significantly fewer resources. This advancement highlights key innovations in AI model…

  • OpenAI : Driving growth and ‘WOW’ moments with OpenAI

    Source URL: https://openai.com/index/ly-corporation Source: OpenAI Title: Driving growth and ‘WOW’ moments with OpenAI Feedly Summary: LY Corporation: Driving growth and ‘WOW’ moments with OpenAI AI Summary and Description: Yes Summary: The text highlights LY Corporation’s engagement with OpenAI, emphasizing the company’s focus on leveraging artificial intelligence to enhance growth and create impactful experiences. This is…

  • Simon Willison’s Weblog: Notes on Google’s Gemma 3

    Source URL: https://simonwillison.net/2025/Mar/12/gemma-3/ Source: Simon Willison’s Weblog Title: Notes on Google’s Gemma 3 Feedly Summary: Google’s Gemma team released an impressive new model today (under their not-open-source Gemma license). Gemma 3 comes in four sizes – 1B, 4B, 12B, and 27B – and while 1B is text-only the larger three models are all multi-modal for…