Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Reset Tasks
Multimodal
Visual Question Answering
Video-Text-to-Text
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Ranking
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
959
Full-text search
Edit filters
Sort: Trending
Active filters:
automatic-speech-recognition
Clear all
openslr/librispeech_asr
Updated
Aug 14, 2024
•
14.4k
•
151
facebook/voxpopuli
Viewer
•
Updated
Oct 14, 2022
•
169k
•
6.92k
•
118
mozilla-foundation/common_voice_11_0
Viewer
•
Updated
Jun 26, 2023
•
6.37M
•
87.2k
•
236
joujiboi/japanese-anime-speech-v2
Viewer
•
Updated
Dec 18, 2024
•
293k
•
1.32k
•
90
future-technologies/Universal-Transformers-Dataset
Preview
•
Updated
17 days ago
•
8.06k
•
105
MaratDV/video-dataset
Viewer
•
Updated
14 days ago
•
6
•
33
•
2
speechcolab/gigaspeech
Viewer
•
Updated
Nov 23, 2023
•
364k
•
13.1k
•
108
hanamizuki-ai/genshin-voice-v3.3-mandarin
Viewer
•
Updated
Dec 31, 2022
•
11.3k
•
1k
•
32
medkit/simsamu
Viewer
•
Updated
Jan 6
•
61
•
191
•
5
FBK-MT/Speech-MASSIVE
Viewer
•
Updated
Aug 8, 2024
•
97.6k
•
835
•
39
amphion/Emilia-Dataset
Viewer
•
Updated
Feb 28
•
54.8M
•
47.6k
•
308
HKUSTAudio/Audio-FLAN-Dataset
Preview
•
Updated
15 days ago
•
3.42k
•
34
BAAI/ChildMandarin
Viewer
•
Updated
Mar 20
•
40.9k
•
108
•
11
facebook/multilingual_librispeech
Viewer
•
Updated
Aug 12, 2024
•
1.49M
•
5.24k
•
134
PolyAI/minds14
Updated
Sep 10, 2024
•
2.99k
•
82
google/fleurs
Updated
Aug 25, 2024
•
27.1k
•
289
MLCommons/peoples_speech
Viewer
•
Updated
Nov 20, 2024
•
8.05M
•
18.5k
•
114
edinburghcstr/ami
Viewer
•
Updated
Jan 16, 2023
•
110k
•
1.79k
•
50
Fhrozen/CABankSakuraCHJP
Preview
•
Updated
Dec 3, 2022
•
93
•
1
language-and-voice-lab/samromur_children
Viewer
•
Updated
Oct 15, 2023
•
86.5k
•
77
•
7
mozilla-foundation/common_voice_12_0
Viewer
•
Updated
Nov 17, 2023
•
6.83M
•
1.64k
•
30
linhtran92/viet_bud500
Viewer
•
Updated
Feb 29, 2024
•
649k
•
585
•
53
cdminix/libritts-r-aligned
Updated
Apr 26, 2024
•
163
•
17
huckiyang/DiPCo
Preview
•
Updated
Feb 6, 2024
•
49
•
5
ProgramComputer/voxceleb
Updated
Jul 27, 2024
•
3.5k
•
79
imvladikon/hebrew_speech_campus
Viewer
•
Updated
Nov 20, 2023
•
75.9k
•
405
•
6
mozilla-foundation/common_voice_15_0
Viewer
•
Updated
Dec 7, 2023
•
7.57M
•
2.33k
•
13
pelcra/pl-asr-pelcra-for-bigos
Updated
Oct 26, 2024
•
35
•
3
doof-ferb/vlsp2020_vinai_100h
Viewer
•
Updated
26 days ago
•
56.4k
•
1.16k
•
10
doof-ferb/fpt_fosd
Viewer
•
Updated
Feb 10, 2024
•
25.9k
•
317
•
1
Previous
1
2
3
...
32
Next