Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

Speechocean HomepageHelp

Home > Commercial Resources > ASR-Corpus

Recording Platform

>

All

Language

>

All

Sort By:Default

300 Results

King-ASR-001

The Chinese Mandarin Mobile Speech Recognition Corpus was collected in China. It contains the voices of 265 different speakers (134 males, 131 females) who were balanced distributed in age (mainly 16 – 28,29 – 45), gender and regional accents (for the details, please see the technical document). The script contains 13,942 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Mobile platform, i.e. Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-002

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 285 different speakers (144 males, 141 females) who were balanced distributed in age (mainly 16 – 28, 29 – 45), gender and regional accents (for the details, please see the technical document). The script contains 14492 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet enviroments.Desktop platform, i.e. Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-003

The Chinese Mandarin Mobile Speech Recognition Corpus was collected in China. It contains the voices of 265 different speakers (134 males, 131 females) who were balanced distributed in age (mainly 16 – 30, 31 – 45, 46+ ), gender and regional accents (for the details, please see the technical document). The script contains 7,606(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet office environment. Mobile platform, i.e. IVR was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-004

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 285 different speakers (144 males, 141 females) who were balanced distributed in age (mainly 16-28, 29-45 ), gender and regional accents (for the details, please see the technical document). The script contains 8109(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Desktop platform, i.e.Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-005

The Chinese Mandarin Mobile Speech Recognition Corpus was collected in China. It contains the voices of 265 different speakers (134 males, 131 females) who were balanced distributed in age (mainly 16 – 28, 29 – 45 ), gender and regional accents (for the details, please see the technical document). The script contains 6972 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Mobile platform, i.e. Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-006

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 285 different speakers (144 males, 141 females) who were balanced distributed in age (mainly), gender and regional accents (for the details, please see the technical document). The script contains 7239 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Desktop platform, i.e.Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification. <div>&nbsp;</div>

King-ASR-007

The Chinese Mandarin Mobile Speech Recognition Corpus was collected in China. It contains the voices of 64 different speakers (52 males, 12 females) who were balanced distributed in age (mainly 16-28, 29-45), gender and regional accents (for the details, please see the technical document). The script contains 3190(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Popular mobile platform, i.e. Windows XP was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-008

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 86 different speakers (64 males, 22 females) who were balanced distributed in age (mainly 16-28, 29-45 ), gender and regional accents (for the details, please see the technical document). The script contains 4282(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment.Desktop platform, i.e.Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-009

The Chinese Mandarin Desktop Speech Recognition Corpus was collected in China. It contains the voices of 260 different speakers (87 males, 113 females) who were balanced distributed in age (mainly 18~29, 30~49, 50+), gender and regional accents (for the details, please see the technical document). The script contains 124595 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Desktop platform, i.e. Windows XP was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-010

The Chinese Mandarin Desktop Speech Recognition Corpus was collected in China. It contains the voices of 200 different speakers (114 males, 86 females) who were balanced distributed in age (mainly 16 &ndash; 30,31 &ndash; 45,46 &ndash; 60), gender and regional accents (for the details, please see the technical document). The script contains 24000 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Desktop platform, i.e. Windows XP was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

1 2 3 4 5 6 7 8 9 10