
40 People - Multi-level Control Multi-emotional Paralanguage Annotated Speech Synthesis Corpus

500 Hours - Japanese(Japan) 48khz Full-Duplex Spontaneous Dialogue Microphone speech dataset

100 hours - Dutch(Netherlands) Entities Scripted Monologue Smartphone speech dataset

100 hours - Thai(Thailand) Entities Scripted Monologue Smartphone speech dataset

100 hours - Japanese(Japan) Entities Scripted Monologue Smartphone speech dataset

100 hours - Italian(Italy) Entities Scripted Monologue Smartphone speech dataset

100 hours - German(Germany) Entities Scripted Monologue Smartphone speech dataset

100 hours - French(France) Entities Scripted Monologue Smartphone speech dataset

100 hours - English Entities Scripted Monologue Smartphone speech dataset

100 hours - Mandarin Chinese(China) Entities Scripted Monologue Smartphone speech dataset

100 hours - Arabic Entities Scripted Monologue Smartphone speech dataset

100 hours - Spanish(Spain) Entities Scripted Monologue Smartphone speech dataset

100 hours - Portuguese(European) Entities Scripted Monologue Smartphone speech dataset

601 Hours - Spanish(Argentina) Real-world Casual Conversation and Monologue speech dataset

200,000 Sets of Multi-country Landmark Buildings Image Caption Data

581 Hours - Greek Real-world Casual Conversation and Monologue speech dataset

600 Hours - Norwegian Real-world Casual Conversation and Monologue speech dataset

3D High-Fidelity Synthetic Data - DMS

1,000 Images – Japanese Invoices Collection Data

Japanese OKWAVE Q&A platform Text Parsing and Processing Data
. . .



