Tag: Hugging Face transformers

  • Simon Willison’s Weblog: Trying out QvQ – Qwen’s new visual reasoning model

    Source URL: https://simonwillison.net/2024/Dec/24/qvq/#atom-everything Source: Simon Willison’s Weblog Title: Trying out QvQ – Qwen’s new visual reasoning model Feedly Summary: I thought we were done for major model releases in 2024, but apparently not: Alibaba’s Qwen team just dropped the Apache2 2 licensed QvQ-72B-Preview, “an experimental research model focusing on enhancing visual reasoning capabilities". Their blog…

  • Simon Willison’s Weblog: Whisper large-v3-turbo model

    Source URL: https://simonwillison.net/2024/Oct/1/whisper-large-v3-turbo-model/#atom-everything Source: Simon Willison’s Weblog Title: Whisper large-v3-turbo model Feedly Summary: Whisper large-v3-turbo model It’s OpenAI DevDay today. Last year they released a whole stack of new features, including GPT-4 vision and GPTs and their text-to-speech API, so I’m intrigued to see what they release today (I’ll be at the San Francisco event).…