9,000 Images of 180 People - Driver Gesture 21 Landmarks Annotation Data
19 Hours Bus Scene Noise Data by Voice Recorder
10 million - English Test Questions Text Parsing And Processing Data
190 Hours - French(France) Gaming Real-world Casual Conversation and Monologue speech dataset
217 Hours - Spanish Financial Entities Real-world Casual Conversation and Monologue speech dataset
203 Hours - German(Germany) Financial Entities Real-world Casual Conversation and Monologue speech dataset
2 People - Korean Average Tone Speech Synthesis Corpus
105 Hours - Italian(Italy) Gaming Real-world Casual Conversation and Monologue speech dataset
206 Hours - English Financial Entities Real-world Casual Conversation and Monologue speech dataset
198 Hours - Spanish Gaming Real-world Casual Conversation and Monologue speech dataset
411 Hours - English Medical Scripted Dialogue speech dataset
203 Hours - Korean(Korea) Medical Entities Real-world Casual Conversation and Monologue speech dataset
215 Hours - Korean(Korea) Financial Entities Real-world Casual Conversation and Monologue speech dataset
4,484 People Multi-race – Infrared Face Recognition Data
500,605 Images - Individual Photo Face Data
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus
839 Hours - Romanian(Romania) Real-world Casual Conversation and Monologue speech dataset
373 Hours - Dari(Afghanistan) Spontaneous Dialogue Smartphone speech dataset
116,048 Sets - 3D Handpose Dataset
100,000 Instruction-Following Evaluation SFT for Chinese LLM Text Data
. . .