Application Aktualisiert 2026-04
Text-to-Speech
Definition
Text-to-Speech converts written text into spoken voice using AI, with increasingly realistic results.
Siehe auch im Glossar
G
Generative AI
Generative AI refers to artificial intelligence systems capable of creating original content: text, images, video, audio, code.
M
Multimodal
A multimodal model processes and generates multiple data types: text, images, audio and video.
N
NLP (Natural Language Processing)
NLP is the field of AI that enables machines to understand, interpret and generate human language.
S
Speech-to-Text
Speech-to-Text converts spoken words into written text, enabling automatic transcription of meetings, podcasts and calls.
Tools, die text-to-speech verwenden
Häufig gestellte Fragen
What's the best Text-to-Speech tool?
ElevenLabs for voice quality, Murf AI for professional voices in 120+ languages, Descript for complete audio editing.
Can you clone your voice?
Yes. ElevenLabs clones your voice with a few seconds of audio. Descript also offers voice cloning for fixing passages.