Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

Speechocean HomepageHelp

Home > Commercial Resources > ASR-Corpus

Recording Platform

>

All

Language

>

All

Sort By:Default

307 Results

King-ASR-001

The United Arab Emirates(UAE) Arabic Speech Recognition Corpus was collected in United Arab Emirates(UAE).<br /> <br /> The script contains 124,672(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.<br /> <br /> This corpus contains the voices of 104 different speakers (55 males, 49 females) who were balanced distributed in age (mainly 18 – 30,31 – 45,46 – 60), gender and regional accents. Each speaker was recorded in quiet office and home environment. <br /> <br /> Desktop platform, i.e. Windows OP was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in Pinyin. All manually checked. All audio files were manually transcribed and annotated by native transcribers.<br /> <br /> For more details, please check the technical document or ask our sales people.<br /> <br /> Contact Information:<br /> Phone: +86-10-62660053<br /> Email: contact@speechocean.com<br />

King-ASR-002

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 285 different speakers (144 males, 141 females) who were balanced distributed in age (mainly 16 &ndash; 28, 29 &ndash; 45), gender and regional accents (for the details, please see the technical document). The script contains 14492 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet enviroments.Desktop platform, i.e. Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-003

The Chinese Mandarin Mobile Speech Recognition Corpus was collected in China. It contains the voices of 265 different speakers (134 males, 131 females) who were balanced distributed in age (mainly 16 &ndash; 30, 31 &ndash; 45, 46+ ), gender and regional accents (for the details, please see the technical document). The script contains 7,606(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet office environment. Mobile platform, i.e. IVR was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-004

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 285 different speakers (144 males, 141 females) who were balanced distributed in age (mainly 16-28, 29-45 ), gender and regional accents (for the details, please see the technical document). The script contains 8109(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Desktop platform, i.e.Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-005

The Chinese Mandarin Mobile Speech Recognition Corpus was collected in China. It contains the voices of 265 different speakers (134 males, 131 females) who were balanced distributed in age (mainly 16 &ndash; 28, 29 &ndash; 45 ), gender and regional accents (for the details, please see the technical document). The script contains 6972 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Mobile platform, i.e. Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-006

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 285 different speakers (144 males, 141 females) who were balanced distributed in age (mainly), gender and regional accents (for the details, please see the technical document). The script contains 7239 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Desktop platform, i.e.Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification. <div>&nbsp;</div>

King-ASR-007

The Chinese Mandarin Mobile Speech Recognition Corpus was collected in China. It contains the voices of 64 different speakers (52 males, 12 females) who were balanced distributed in age (mainly 16-28, 29-45), gender and regional accents (for the details, please see the technical document). The script contains 3190(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Popular mobile platform, i.e. Windows XP was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-008

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 86 different speakers (64 males, 22 females) who were balanced distributed in age (mainly 16-28, 29-45 ), gender and regional accents (for the details, please see the technical document). The script contains 4282(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment.Desktop platform, i.e.Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-009

The Chinese Mandarin Desktop Speech Recognition Corpus was collected in China. It contains the voices of 260 different speakers (87 males, 113 females) who were balanced distributed in age (mainly 18~29, 30~49, 50+), gender and regional accents (for the details, please see the technical document). The script contains 124595 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Desktop platform, i.e. Windows XP was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-010

The Chinese Mandarin Desktop Speech Recognition Corpus was collected in China. It contains the voices of 200 different speakers (114 males, 86 females) who were balanced distributed in age (mainly 16 &ndash; 30,31 &ndash; 45,46 &ndash; 60), gender and regional accents (for the details, please see the technical document). The script contains 24000 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Desktop platform, i.e. Windows XP was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

1 2 3 4 5 6 7 8 9 10