Qwen3-TTS
A tool to generate speech with voice cloning.
Qwen3-TTS is an AI-powered open-source text-to-speech model family that generates ultra-realistic, human-like audio with features like 3-second voice cloning, natural-language voice design, and fine-grained control over timbre, emotion, prosody, and speaking rate; it delivers low-latency streaming (~97 ms), supports 10 languages/9 dialects and 49 styles, comes in 0.6B (efficient) and 1.7B (high-performance) variants for long-form output, and is available via API, Python package, Hugging Face and GitHub under Apache‑2.0—making it ideal for creators, developers, and businesses needing customizable, high-fidelity AI TTS for narration, assistants, games, audiobooks, and real-time applications.
More in Text-To-Speech
Eleven Labs
Create natural sounding voices for creators and publishers
elevenlabs.ioText To Song
Uses AI to take your text and turn it into a song
voicemod.netVerbatik
A tool for multilingual text to voice generation.
verbatik.comDeepBrain AI
A tool to create text-to-speech videos.
deepbrain.ioResemble.ai
AI realistic text-to-speech voice generator - Can train your own voice
resemble.aiSuno
A tool to create music and speech.
suno.ai