Tag: llms

  • Hacker News: I can now run a GPT-4 class model on my laptop

    Source URL: https://simonwillison.net/2024/Dec/9/llama-33-70b/ Source: Hacker News Title: I can now run a GPT-4 class model on my laptop Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the advances in consumer-grade hardware capable of running powerful Large Language Models (LLMs), specifically highlighting Meta’s Llama 3.3 model’s performance on a MacBook Pro M2.…

  • Slashdot: Microsoft AI Chief Says Conversational AI Will Replace Web Browsers

    Source URL: https://slashdot.org/story/24/12/09/1548200/microsoft-ai-chief-says-conversational-ai-will-replace-web-browsers?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Microsoft AI Chief Says Conversational AI Will Replace Web Browsers Feedly Summary: AI Summary and Description: Yes Summary: Mustafa Suleyman, CEO of Microsoft’s AI division, forecasts that conversational AI will replace traditional web interaction methods, revolutionizing how users engage with technology. He emphasizes the future significance of large language…

  • Simon Willison’s Weblog: I can now run a GPT-4 class model on my laptop

    Source URL: https://simonwillison.net/2024/Dec/9/llama-33-70b/ Source: Simon Willison’s Weblog Title: I can now run a GPT-4 class model on my laptop Feedly Summary: Meta’s new Llama 3.3 70B is a genuinely GPT-4 class Large Language Model that runs on my laptop. Just 20 months ago I was amazed to see something that felt GPT-3 class run on…

  • Hacker News: Task-Specific LLM Evals That Do and Don’t Work

    Source URL: https://eugeneyan.com/writing/evals/ Source: Hacker News Title: Task-Specific LLM Evals That Do and Don’t Work Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents a comprehensive overview of evaluation metrics for machine learning tasks, specifically focusing on classification, summarization, and translation within the context of large language models (LLMs). It highlights the…

  • The Register: Microsoft dangles $10K for hackers to hijack LLM email service

    Source URL: https://www.theregister.com/2024/12/09/microsoft_llm_prompt_injection_challenge/ Source: The Register Title: Microsoft dangles $10K for hackers to hijack LLM email service Feedly Summary: Outsmart an AI, win a little Christmas cash Microsoft and friends have challenged AI hackers to break a simulated LLM-integrated email client with a prompt injection attack – and the winning teams will share a $10,000…

  • Simon Willison’s Weblog: llm-openrouter 0.3

    Source URL: https://simonwillison.net/2024/Dec/8/llm-openrouter-03/#atom-everything Source: Simon Willison’s Weblog Title: llm-openrouter 0.3 Feedly Summary: llm-openrouter 0.3 New release of my llm-openrouter plugin, which allows LLM to access models hosted by OpenRouter. Quoting the release notes: Enable image attachments for models that support images. Thanks, Adam Montgomery. #12 Provide async model access. #15 Fix documentation to list correct…

  • Simon Willison’s Weblog: Quoting Ethan Mollick

    Source URL: https://simonwillison.net/2024/Dec/7/ethan-mollick/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Ethan Mollick Feedly Summary: A test of how seriously your firm is taking AI: when o-1 (& the new Gemini) came out this week, were there assigned folks who immediately ran the model through internal, validated, firm-specific benchmarks to see how useful it as? Did you…

  • Simon Willison’s Weblog: Meta AI release Llama 3.3

    Source URL: https://simonwillison.net/2024/Dec/6/llama-33/#atom-everything Source: Simon Willison’s Weblog Title: Meta AI release Llama 3.3 Feedly Summary: Meta AI release Llama 3.3 This new Llama-3.3-70B-Instruct model from Meta AI makes some bold claims: This model delivers similar performance to Llama 3.1 405B with cost effective inference that’s feasible to run locally on common developer workstations. I have…

  • Simon Willison’s Weblog: New Gemini model: gemini-exp-1206

    Source URL: https://simonwillison.net/2024/Dec/6/gemini-exp-1206/#atom-everything Source: Simon Willison’s Weblog Title: New Gemini model: gemini-exp-1206 Feedly Summary: New Gemini model: gemini-exp-1206 Google’s Jeff Dean: Today’s the one year anniversary of our first Gemini model releases! And it’s never looked better. Check out our newest release, Gemini-exp-1206, in Google AI Studio and the Gemini API! I upgraded my llm-gemini…

  • Embrace The Red: Terminal DiLLMa: LLM-powered Apps Can Hijack Your Terminal Via Prompt Injection

    Source URL: https://embracethered.com/blog/posts/2024/terminal-dillmas-prompt-injection-ansi-sequences/ Source: Embrace The Red Title: Terminal DiLLMa: LLM-powered Apps Can Hijack Your Terminal Via Prompt Injection Feedly Summary: Last week Leon Derczynski described how LLMs can output ANSI escape codes. These codes, also known as control characters, are interpreted by terminal emulators and modify behavior. This discovery resonates with areas I had…