1
9,000 Images of 180 People - Driver Gesture 21 Landmarks Annotation Data
19 Hours Bus Scene Noise Data by Voice Recorder
10 million - English Test Questions Text Parsing And Processing Data
190 Hours - French(France) Gaming Real-world Casual Conversation and Monologue speech dataset
217 Hours - Spanish Financial Entities Real-world Casual Conversation and Monologue speech dataset
200 Hours - Portuguese(Brazil) Financial Entities Real-world Casual Conversation and Monologue speech dataset
203 Hours - German(Germany) Financial Entities Real-world Casual Conversation and Monologue speech dataset
2 People - Korean Average Tone Speech Synthesis Corpus
14 Hours - Taiwan Mandarin Seven Style Average Tone Speech Synthesis Corpus
105 Hours - Italian(Italy) Gaming Real-world Casual Conversation and Monologue speech dataset
300 Hours - English(India) Spontaneous Dialogue Smartphone speech dataset
206 Hours - English Financial Entities Real-world Casual Conversation and Monologue speech dataset
198 Hours - Spanish Gaming Real-world Casual Conversation and Monologue speech dataset
411 Hours - English Medical Scripted Dialogue speech dataset
203 Hours - Korean(Korea) Medical Entities Real-world Casual Conversation and Monologue speech dataset
215 Hours - Korean(Korea) Financial Entities Real-world Casual Conversation and Monologue speech dataset
4,484 People Multi-race – Infrared Face Recognition Data
500,605 Images - Individual Photo Face Data
839 Hours - Romanian(Romania) Real-world Casual Conversation and Monologue speech dataset
. . .