Tag: benchmark
-
Simon Willison’s Weblog: Qwen: Extending the Context Length to 1M Tokens
Source URL: https://simonwillison.net/2024/Nov/18/qwen-turbo/#atom-everything Source: Simon Willison’s Weblog Title: Qwen: Extending the Context Length to 1M Tokens Feedly Summary: Qwen: Extending the Context Length to 1M Tokens The new Qwen2.5-Turbo boasts a million token context window (up from 128,000 for Qwen 2.5) and faster performance: Using sparse attention mechanisms, we successfully reduced the time to first…
-
Simon Willison’s Weblog: Quoting Jack Clark
Source URL: https://simonwillison.net/2024/Nov/18/jack-clark/ Source: Simon Willison’s Weblog Title: Quoting Jack Clark Feedly Summary: The main innovation here is just using more data. Specifically, Qwen2.5 Coder is a continuation of an earlier Qwen 2.5 model. The original Qwen 2.5 model was trained on 18 trillion tokens spread across a variety of languages and tasks (e.g, writing,…
-
Hacker News: SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks
Source URL: https://arxiv.org/abs/2310.03684 Source: Hacker News Title: SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks Feedly Summary: Comments AI Summary and Description: Yes Summary: This text presents “SmoothLLM,” an innovative algorithm designed to enhance the security of Large Language Models (LLMs) against jailbreaking attacks, which manipulate models into producing undesirable content. The proposal highlights a…
-
Hacker News: Don’t Look Twice: Faster Video Transformers with Run-Length Tokenization
Source URL: https://rccchoudhury.github.io/rlt/ Source: Hacker News Title: Don’t Look Twice: Faster Video Transformers with Run-Length Tokenization Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents a novel approach called Run-Length Tokenization (RLT) aimed at optimizing video transformers by eliminating redundant tokens. This content-aware method results in substantial speed improvements for training and…
-
Cloud Blog: Dataproc Serverless: Now faster, easier and smarter
Source URL: https://cloud.google.com/blog/products/data-analytics/dataproc-serverless-performance-and-usability-updates/ Source: Cloud Blog Title: Dataproc Serverless: Now faster, easier and smarter Feedly Summary: We are thrilled to announce new capabilities that make running Dataproc Serverless even faster, easier, and more intelligent. Elevate your Spark experience with: Native query execution: Experience significant performance gains with the new Native query execution in the Premium…
-
Slashdot: AI Systems Solve Just 2% of Advanced Maths Problems in New Benchmark Test
Source URL: https://science.slashdot.org/story/24/11/13/1244216/ai-systems-solve-just-2-of-advanced-maths-problems-in-new-benchmark-test?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: AI Systems Solve Just 2% of Advanced Maths Problems in New Benchmark Test Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the limitations of leading AI systems in solving complex mathematics problems presented in a new benchmark called FrontierMath. Despite achieving high accuracy on traditional math…
-
The Register: California’s last nuclear plant turns to generative AI for filing and finding the fine print
Source URL: https://www.theregister.com/2024/11/13/nuclear_plant_generative_ai/ Source: The Register Title: California’s last nuclear plant turns to generative AI for filing and finding the fine print Feedly Summary: Diablo Canyon gets nifty new tech to … speed up document retrieval? A California startup is deploying what it says is the first commercial installation of generative AI at a US…