Tag: speech processing
-
AWS News Blog: Introducing Amazon Nova Sonic: Human-like voice conversations for generative AI applications
Source URL: https://aws.amazon.com/blogs/aws/introducing-amazon-nova-sonic-human-like-voice-conversations-for-generative-ai-applications/ Source: AWS News Blog Title: Introducing Amazon Nova Sonic: Human-like voice conversations for generative AI applications Feedly Summary: Amazon Nova Sonic is a new foundation model on Amazon Bedrock that streamlines speech-enabled applications by offering unified speech recognition and generation capabilities, enabling natural conversations with contextual understanding while eliminating the need for…
-
Hacker News: Spark-TTS: Text-2-Speech Model Single-Stream Decoupled Tokens [pdf]
Source URL: https://arxiv.org/abs/2503.01710 Source: Hacker News Title: Spark-TTS: Text-2-Speech Model Single-Stream Decoupled Tokens [pdf] Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses Spark-TTS, an innovative LLM-based text-to-speech model that contributes to advancements in zero-shot TTS synthesis. Its efficient design allows for customizable voice generation through a unique token representation and a…