A comprehensive list of open source voice and music datasets. I released this for the talk @ the VOICE Summit 2019. |
MUSAN is a corpus of music, speech and noise. This dataset is suitable for training models for voice activity detection (VAD) and music/speech discrimination. |
The People's Speech Dataset contains 30000 hours of conversational English speech recognition licensed for academic and commercial machine learning usage. |
19 февр. 2024 г. · Top Open-source Speech Data Resources for Machine Learning · #1 LibriSpeech · #2 Common Voice by Mozilla · #3 TED-LIUM · #4 VoxForge · #5 TIMIT ... |
A human nonverbal vocal sound dataset by Deeply Inc. SLR100, Multilingual TEDx, Speech, a multilingual corpus of TEDx talks for speech recognition and ... |
The VOICES corpus is a dataset to promote speech and signal processing research of speech recorded by far-field microphones in noisy room conditions. Datasets · Speech Commands · Speech 58 · Common Voice |
This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. |
Over 110 speech datasets are collected in this repository, and more than 70 datasets can be downloaded directly without further application or registration. |
Let's explore a few datasets suitable for TTS that you can find on the Hub. LJSpeech. LJSpeech is a dataset that consists of 13,100 English-language audio ... |
Novbeti > |
Axtarisha Qayit Anarim.Az Anarim.Az Sayt Rehberliyi ile Elaqe Saytdan Istifade Qaydalari Anarim.Az 2004-2023 |