Tag: web

  • Simon Willison’s Weblog: DeepSeek_V3.pdf

    Source URL: https://simonwillison.net/2024/Dec/26/deepseek-v3/#atom-everything Source: Simon Willison’s Weblog Title: DeepSeek_V3.pdf Feedly Summary: DeepSeek_V3.pdf The DeepSeek v3 paper (and model card) are out, after yesterday’s mysterious release of the undocumented model weights. Plenty of interesting details in here. The model pre-trained on 14.8 trillion “high-quality and diverse tokens" (not otherwise documented). Following this, we conduct post-training, including…

  • Simon Willison’s Weblog: Quoting EU Artificial Intelligence Act

    Source URL: https://simonwillison.net/2024/Dec/26/eu-artificial-intelligence-act/#atom-everything Source: Simon Willison’s Weblog Title: Quoting EU Artificial Intelligence Act Feedly Summary: Providers and deployers of AI systems shall take measures to ensure, to their best extent, a sufficient level of AI literacy of their staff and other persons dealing with the operation and use of AI systems on their behalf, taking…

  • Simon Willison’s Weblog: deepseek-ai/DeepSeek-V3-Base

    Source URL: https://simonwillison.net/2024/Dec/25/deepseek-v3/#atom-everything Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-V3-Base Feedly Summary: deepseek-ai/DeepSeek-V3-Base No model card or announcement yet, but this new model release from Chinese AI lab DeepSeek (an arm of Chinese hedge fund High-Flyer) looks very significant. It’s a huge model – 685B parameters, 687.9 GB on disk (TIL how to size a git-lfs…

  • Simon Willison’s Weblog: Trying out QvQ – Qwen’s new visual reasoning model

    Source URL: https://simonwillison.net/2024/Dec/24/qvq/#atom-everything Source: Simon Willison’s Weblog Title: Trying out QvQ – Qwen’s new visual reasoning model Feedly Summary: I thought we were done for major model releases in 2024, but apparently not: Alibaba’s Qwen team just dropped the Apache2 2 licensed QvQ-72B-Preview, “an experimental research model focusing on enhancing visual reasoning capabilities". Their blog…

  • Simon Willison’s Weblog: Quoting Paige Bailey

    Source URL: https://simonwillison.net/2024/Dec/24/paige-bailey/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Paige Bailey Feedly Summary: it’s really hard not to be obsessed with these tools. It’s like having a bespoke, free, (usually) accurate curiosity-satisfier in your pocket, no matter where you go – if you know how to ask questions, then suddenly the world is an audiobook…

  • The Register: How Androxgh0st rose from Mozi’s ashes to become ‘most prevalent malware’

    Source URL: https://www.theregister.com/2024/12/24/androxgh0st_botnet_mozi/ Source: The Register Title: How Androxgh0st rose from Mozi’s ashes to become ‘most prevalent malware’ Feedly Summary: Botnet’s operators ‘driven by similar interests as that of the Chinese state’ After the Mozi botnet mysteriously disappeared last year, a new and seemingly more powerful botnet, Androxgh0st, rose from its ashes and has quickly…

  • Simon Willison’s Weblog: Finally, a Replacement for BERT: Introducing ModernBERT

    Source URL: https://simonwillison.net/2024/Dec/24/modernbert/ Source: Simon Willison’s Weblog Title: Finally, a Replacement for BERT: Introducing ModernBERT Feedly Summary: Finally, a Replacement for BERT: Introducing ModernBERT BERT was an early language model released by Google in October 2018. Unlike modern LLMs it wasn’t designed for generating text. BERT was trained for masked token prediction and was generally…

  • MCP Server Cloud – The Model Context Protocol Server Directory: MCP Azure OpenAI Server – MCP Server Integration

    Source URL: https://mcpserver.cloud/server/mcp-azure-openai-server Source: MCP Server Cloud – The Model Context Protocol Server Directory Title: MCP Azure OpenAI Server – MCP Server Integration Feedly Summary: AI Summary and Description: Yes **Summary:** The text discusses the implementation of the Model Context Protocol (MCP) for integrating AI applications, particularly with Azure OpenAI. It highlights the architecture, configuration…

  • Hacker News: Co-Adapting Human Interfaces and LMs

    Source URL: https://jessylin.com/2024/11/12/co-adapting-human-interfaces/ Source: Hacker News Title: Co-Adapting Human Interfaces and LMs Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the adaptive relationship between language models (LMs) and the digital environments they interact with, highlighting a shift in how systems are designed to cater to LMs as users. It emphasizes both…