speech datasets

jim-schwoebel/voice_datasets - GitHub github.com › jim-schwoebel › voice_datasets

A comprehensive list of open source voice and music datasets. I released this for the talk @ the VOICE Summit 2019.

Datasets - Speech Recognition - Papers With Code paperswithcode.com › datasets › mod=speech

MUSAN is a corpus of music, speech and noise. This dataset is suitable for training models for voice activity detection (VAD) and music/speech discrimination.

MLCommons People's Speech Dataset mlcommons.org › Datasets

The People's Speech Dataset contains 30000 hours of conversational English speech recognition licensed for academic and commercial machine learning usage.

10 Open-source Speech Data Resources for Machine Learning waywithwords.net › resource › open-source-spe...

19 февр. 2024 г. · Top Open-source Speech Data Resources for Machine Learning · #1 LibriSpeech · #2 Common Voice by Mozilla · #3 TED-LIUM · #4 VoxForge · #5 TIMIT ...

Open Speech and Language Resources - openslr.org www.openslr.org › resources

A human nonverbal vocal sound dataset by Deeply Inc. SLR100, Multilingual TEDx, Speech, a multilingual corpus of TEDx talks for speech recognition and ...

Datasets - Speech Recognition - Papers With Code paperswithcode.com › datasets › task=speech-re...

The VOICES corpus is a dataset to promote speech and signal processing research of speech recorded by far-field microphones in noisy room conditions. Datasets · Speech Commands · Speech 58 · Common Voice

The LJ Speech Dataset - Kaggle www.kaggle.com › datasets › mathurinache › th...

This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books.

RevoSpeechTech/speech-datasets-collection - GitHub github.com › RevoSpeechTech › speech-dataset...

Over 110 speech datasets are collected in this repository, and more than 70 datasets can be downloaded directly without further application or registration.

Mozilla Common Voice dataset commonvoice.mozilla.org › datasets

Text-to-speech datasets - Hugging Face Audio Course huggingface.co › learn › chapter6 › tts_datasets

Let's explore a few datasets suitable for TTS that you can find on the Hub. LJSpeech. LJSpeech is a dataset that consists of 13,100 English-language audio ...

Запросы по теме

speech commands dataset

speech dataset kaggle

tts datasets

common voice dataset

video datasets

deepfake audio dataset

emotional speech dataset

audio datasets for machine learning