Core Highlights (TL;DR)Qwen3-TTS is a powerful open-source text-to-speech model supporting voice cloning, voice design, and multilingual generation across 10 languages3-Second Voice Cloning: Using the Qwen3-TTS base model, clone any voice with just 3 seconds of audio inputIndustry-Leading Performance: Surpasses competitors like MiniMax, ElevenLabs, and SeedTTS in voice quality and speaker similarityDual-Track Streaming Architecture: Achieves ultra-low 97ms latency through Qwen3-TTS, suitable for real-time applicationsApache 2.0 License: Full...