
Whisper (OpenAI)
Translate audio or video to text with language translation
Whisper is an open-source automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. It is designed to be robust to accents, background noise and technical language, and can transcribe and translate speech in multiple languages into English. It is a simple end-to-end approach, implemented as an encoder-decoder Transformer. It is also capable of performing language identification and phrase-level timestamps. It is designed to be easy to use and have high accuracy, allowing developers to add voice interfaces to more applications.
More in Speech-To-Text
Easy-Peasy.AI
AI generated text, images, and transcriptions
easy-peasy.aiVideoDubber
A tool to translate and clone voices in videos.
videodubber.aiType Studio
All-in-one editing tool with transcription, video editing, and repurposing
typestudio.coTranslate.Video
Translate videos with just 1-Click
translate.videoYaps
A tool to transcribe and read text offline.
yaps.aiThing Translator
Take a picture and Google's AI will tell you what it is
experiments.withgoogle.com