speech-to-speech – Experimental News Clipping Site

Simon Willison’s Weblog: Introducing gpt-realtime

Sep 1, 2025

—

by

Source URL: https://simonwillison.net/2025/Sep/1/introducing-gpt-realtime/#atom-everything Source: Simon Willison’s Weblog Title: Introducing gpt-realtime Feedly Summary: Introducing gpt-realtime Released a few days ago (August 28th), gpt-realtime is OpenAI’s new “most advanced speech-to-speech model". It looks like this is a replacement for the older gpt-4o-realtime-preview model that was released last October. This is a slightly confusing release. The previous realtime…

Cloud Blog: Selecting the right Hyperdisk block storage for your workloads

Jun 11, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/storage-data-transfer/how-to-choose-the-right-hyperdisk-block-storage-for-your-use-case/ Source: Cloud Blog Title: Selecting the right Hyperdisk block storage for your workloads Feedly Summary: As you adopt Google Cloud or migrate to the latest Compute Engine VMs or to Google Kubernetes Engine (GKE), selecting the right block storage for your workload is crucial. Hyperdisk, Google Cloud’s workload-optimized block storage that’s designed…

The Register: ‘Savvy’ shortcuts produce near-instant speech-to-speech translation of 36 languages

Jan 15, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/01/15/babel_fish_translations/ Source: The Register Title: ‘Savvy’ shortcuts produce near-instant speech-to-speech translation of 36 languages Feedly Summary: Babel Fish like ML model emerges after training on 4.5 million hours of multilingual spoken audio Meta has developed a machine learning model its researchers claim offers near-instant speech-to-speech translation between around 36 languages.… AI Summary and…

Simon Willison’s Weblog: First impressions of the new Amazon Nova LLMs (via a new llm-bedrock plugin)

Dec 4, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Dec/4/amazon-nova/ Source: Simon Willison’s Weblog Title: First impressions of the new Amazon Nova LLMs (via a new llm-bedrock plugin) Feedly Summary: Amazon released three new Large Language Models yesterday at their AWS re:Invent conference. The new model family is called Amazon Nova and comes in three sizes: Micro, Lite and Pro. I built…

Hacker News: Hugging Face tackles speech-to-speech

Sep 3, 2024

—

by

system automation

in Uncategorized

Source URL: https://github.com/huggingface/speech-to-speech Source: Hacker News Title: Hugging Face tackles speech-to-speech Feedly Summary: Comments AI Summary and Description: Yes Summary: The text describes an open-sourced, modular Speech-to-Speech pipeline utilizing various advanced AI models available on the Hugging Face Hub. This initiative provides significant potential for developers and researchers interested in integrating speech processing capabilities into…

Tag: speech-to-speech

Simon Willison’s Weblog: Introducing gpt-realtime

Cloud Blog: Selecting the right Hyperdisk block storage for your workloads

The Register: ‘Savvy’ shortcuts produce near-instant speech-to-speech translation of 36 languages

Simon Willison’s Weblog: First impressions of the new Amazon Nova LLMs (via a new llm-bedrock plugin)

Hacker News: Hugging Face tackles speech-to-speech