Large publicly available speech datasets
-
Updated
Jun 30, 2021
Large publicly available speech datasets
A service designed to translate speeches in multimedia using AI and ML voice cloning technology.
Place for my articles, researches etc.
Using coquiTTS in kaggle notebook
Voice Cloner is a tool to clone human voices in a very natural and realistic way. The application collects voice samples and generates the audio using text to speech.
Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
Voice Conversions for Japanesse
Problem Statement: Developing A Software For Dubbing Videos.
Making an AI voice from my speaking
This is an Open Source NodeJS package for ElevenLabs Text to Speech API.
Transform Your Voice: Replicate Your Unique Sound in a Pristine Pre-Trained Model and Cultivate Your Custom Voiceprint
A research project and state-of-the-art review on text-to-speech models and voice cloning.
simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate
The official Python API for Revocalize AI voice synthesizer platform.
Voice cloning using coqui-TTS
Fully automated AI-generated fictional crime news stories in the style of a certain Estonian 90s TV show
Add a description, image, and links to the voice-cloning topic page so that developers can more easily learn about it.
To associate your repository with the voice-cloning topic, visit your repo's landing page and select "manage topics."