Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

人工智能数据资源服务平台 Speechocean Homepage Help

Home > Commercial Resources > ASR-Corpus

Recording Platform

>

All

Language

>

All

Sort By:Default

4 Results

King-ASR-126

The Chinese English Speech Recognition Corpus was collected in China.

The script contains 30,076(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 100 different speakers (48 males, 52 females) who were balanced distributed in age (18 – 26), gender and regional accents. Each speaker was recorded in quiet office environment.

Desktop platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in en-us_CMU. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-190

The Chinese English Speech Recognition Corpus was collected in China.

The script contains 579,049(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 2,576 different speakers (1,242 males, 1,334 females) who were balanced distributed in age (18 – 35), gender and regional accents. Each speaker was recorded in both quiet and noisy environments.

Mobile platform, i.e. Android was used for speech collection. A pronunciation lexicon with a phonemic transcription in CMU is available. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-337

The Chinese English Speech Recognition Corpus was collected in China.

The script contains 110,066(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 401 different speakers (200 males, 201 females) who were balanced distributed in age (<20, ≥20), gender and regional accents. Each speaker was recorded in quiet home environment.

Desktop platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in CMU. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-340

The Chinese English Speech Recognition Corpus was collected in China.

The script contains 220,124(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 401 different speakers (200 males, 201 females) who were balanced distributed in age (<20, ≥20), gender and regional accents. Each speaker was recorded in quiet home environment.

Mobile platform, i.e. Android, was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in CMU. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com