Tel:03-6256-8911

jp

数据解决方案

请输入姓名

携帯電話番号が無効です

連絡先を入力してください

会社名を入力してください

有効な仕事用電子メールを入力してください。

ご希望のデータについて入力してください

送信完了しました! ご協力ありがとうございました。

填写格式错误请重新填写

確認する

5文字以下、または数字のみでの入力は無効です。

https://www.datatang.co.jp

1714

_Data Products_Datatang

791 Hours - Mandarin Conversational Speech Data by Microphone_791 Hours - Mandarin Conversational Speech Data by Microphone

791 Hours - Mandarin Conversational Speech Data by Microphone

  • ライセンス認証を経た製品データセットが、AIプロジェクトのスピーディーな立ち上げをアシストします。

791 Hours - Mandarin Conversational Speech Data by Microphone, collected from dialogues based on given topics, covering dozens of generic domain. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers(1,126 people in total), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

お問い合わせ サンプルを入手する

データ仕様

Format
48kHz, 16bit, uncompressed wav, mono channel;
Recording Environment
quiet indoor environment, without echo;
Recording content
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
Demographics
1126 speakers; balanced gender ratio among speakers, with age distribution ranging from 18 to 60 years old;
Annotation
extract and annotate individual sentences with their start and end timestamps, speaker identification, and spoken text content; noise annotation;
Device
Microphone;
Language
Mandarin;
Application scenarios
speech recognition; voiceprint recognition;
Accuracy rate
character accuracy rate of 99%

サンプル紹介

収集対象者からの明確に許可を得た、高品質の製品トレーニングデータセットはが、AIプロジェクトのスピーディーな立ち上げをアシストします。

さっそく始めてみる

関連データのおすすめ

600 Hours - Greek Real-world Casual Conversation and Monologue speech dataset
600 Hours - Greek Real-world Casual Conversation and Monologue speech dataset
600 Hours - Norwegian Real-world Casual Conversation and Monologue speech dataset
600 Hours - Norwegian Real-world Casual Conversation and Monologue speech dataset
Gujatati(India) Scripted dialogue speech dataset
Gujatati(India) Scripted dialogue speech dataset
Spanish(Mexico) Real-world Casual Conversation and Monologue speech dataset
Spanish(Mexico) Real-world Casual Conversation and Monologue speech dataset

Data Features

791 Hours - Mandarin Conversational Speech Data by Microphone

*Name:

*Phone:

*Company:

*E-mail:

*Requirement:

791 Hours - Mandarin Conversational Speech Data by Microphone