Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

人工智能数据资源服务平台 Speechocean Homepage Help

Home > Commercial Resources > ASR-Corpus > Taiwanese Speech Recognition Corpus (Mobile)-Sentences-5232 Speakers

Commercial Resources

Need Corpus For Academic Research?

Click Here

King-ASR-044

Taiwanese Speech Recognition Corpus (Mobile)-Sentences-5232 Speakers

Corpus Details

S.N

>

King-ASR-044

Corpus Name

>

Taiwanese Speech Recognition Corpus (Mobile)-Sentences-5232 Speakers

Producer

>

Speechocean

IPR Ownership

>

Speechocean

Language/Accent

>

Taiwanese

Size

>

234 GB

Recording Hours

>

2,184 Hours

Recording Platform

>

Mobile

Parameters

>

16k,16bit; 1 Channel

Introduction

>

The Taiwanese Speech Recognition Corpus was collected in Taiwan.

The script contains 1,643,451(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 5,232 different speakers (2,365 males and 2,867 females) who were balanced distributed in age (16 – 30, 31 – 45, >45), gender and regional accents. Each speaker was recorded in quiet or noisy environment.

Mobile platform, i.e. Android was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in zh-tw_pinyin. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

数据评价

0

好评度

好评0

中评0

差评0