webgpu – Experimental News Clipping Site

Simon Willison’s Weblog: Load Llama-3.2 WebGPU in your browser from a local folder

Sep 8, 2025

—

by

Source URL: https://simonwillison.net/2025/Sep/8/webgpu-local-folder/#atom-everything Source: Simon Willison’s Weblog Title: Load Llama-3.2 WebGPU in your browser from a local folder Feedly Summary: Load Llama-3.2 WebGPU in your browser from a local folder Inspired by a comment on Hacker News I decided to see if it was possible to modify the transformers.js-examples/tree/main/llama-3.2-webgpu Llama 3.2 chat demo (online here,…

Hacker News: Show HN: In-Browser Graph RAG with Kuzu-WASM and WebLLM

Mar 10, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://blog.kuzudb.com/post/kuzu-wasm-rag/ Source: Hacker News Title: Show HN: In-Browser Graph RAG with Kuzu-WASM and WebLLM Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the launch of Kuzu’s WebAssembly (Wasm) version, showcasing its use in building an advanced in-browser chatbot leveraging graph retrieval techniques. Noteworthy is the emphasis on privacy and…

Bulletins: Vulnerability Summary for the Week of February 3, 2025

Feb 10, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.cisa.gov/news-events/bulletins/sb25-041 Source: Bulletins Title: Vulnerability Summary for the Week of February 3, 2025 Feedly Summary: High Vulnerabilities PrimaryVendor — Product Description Published CVSS Score Source Info .TUBE gTLD–.TUBE Video Curator Improper Neutralization of Input During Web Page Generation (‘Cross-site Scripting’) vulnerability in .TUBE gTLD .TUBE Video Curator allows Reflected XSS. This issue affects…

Hacker News: Show HN: BrowserAI – Run LLMs directly in browser using WebGPU (open source)

Jan 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://github.com/sauravpanda/BrowserAI Source: Hacker News Title: Show HN: BrowserAI – Run LLMs directly in browser using WebGPU (open source) Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents BrowserAI, a browser-based platform allowing users to run large language models (LLMs) directly within their browsers without needing complex server infrastructure. It emphasizes…

Simon Willison’s Weblog: Structured Generation w/ SmolLM2 running in browser & WebGPU

Nov 29, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Nov/29/structured-generation-smollm2-webgpu/#atom-everything Source: Simon Willison’s Weblog Title: Structured Generation w/ SmolLM2 running in browser & WebGPU Feedly Summary: Structured Generation w/ SmolLM2 running in browser & WebGPU Extraordinary demo by Vaibhav Srivastav. Here’s Hugging Face’s SmolLM2-1.7B-Instruct running directly in a web browser (using WebGPU, so requires Chrome for the moment) demonstrating structured text extraction,…

Simon Willison’s Weblog: llama-3.2-webgpu

Sep 30, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Sep/30/llama-32-webgpu/#atom-everything Source: Simon Willison’s Weblog Title: llama-3.2-webgpu Feedly Summary: llama-3.2-webgpu Llama 3.2 1B is a really interesting models, given its 128,000 token input and its tiny size (barely more than a GB). This page loads a 1.24GB q4f16 ONNX build of the Llama-3.2-1B-Instruct model and runs it with a React-powered chat interface directly…

Tag: webgpu

Simon Willison’s Weblog: Load Llama-3.2 WebGPU in your browser from a local folder

Hacker News: Show HN: In-Browser Graph RAG with Kuzu-WASM and WebLLM

Bulletins: Vulnerability Summary for the Week of February 3, 2025

Hacker News: Show HN: BrowserAI – Run LLMs directly in browser using WebGPU (open source)

Simon Willison’s Weblog: Structured Generation w/ SmolLM2 running in browser & WebGPU

Simon Willison’s Weblog: llama-3.2-webgpu