Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

人工智能数据资源服务平台 Speechocean Homepage Help

Home > Commercial Resources > ASR-Corpus

Recording Platform

>

All

Language

>

All

Sort By:Default

18 Results

King-ASR-040

The American English Speech Recognition Corpus was collected in America.

The corpus contains 39,243(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 151 different speakers (74 males, 77 females) who were balanced distributed in age (18 – 35, 36 – 45, 46 – 60), gender and regional accents. Each speaker recorded in quiet and noisy environments.

Mobile platform, i.e. Android, was used for speech collection. A pronunciation lexicon is available with a phonemic transcription. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-056

The American English Speech Recognition Corpus was collected in USA.

The corpus contains 29,976 (approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 50 different speakers (23 males, 27 females) who were balanced distributed in age (16 – 30, 31 – 45, 46 – 60), gender and regional accents. Each speaker recorded in quiet and noisy environments.

Desktop platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in SAMPA. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-066

The American English Speech Recognition Corpus was collected in USA.

The script contains 829,631(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 2,602 different speakers (1,232 males, 1,370 females) who were balanced distributed in age (16 – 30, 31 – 45, >45), gender and regional accents. Each speaker was recorded in quiet office and home environment.

Mobile platform, i.e. Android was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in CMU. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-081

The American English Speech Recognition Corpus was collected in USA.

The script contains 14,988(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 50 different speakers (23 males, 27 females) who were balanced distributed in age (16 – 30, 31 – 45, 46 – 60), gender and regional accents. Each speaker was recorded in quiet and noisy environments.

Mobile platform, i.e. Android was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in en-us_CMU. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-090

The American English Speech Recognition Corpus was collected in USA.

The corpus contains 49,990(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 50 different speakers (25 males, 25 females) who were balanced distributed in age (16 – 30, 31 – 45, 46 – 60), gender and regional accents. Each speaker recorded in quiet office environment.

Desktop platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-131

The American English Speech Recognition Corpus was collected in USA.

The script contains 383,788(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 304 different speakers (152 males, 152 females) who were balanced distributed in age (mainly 18 – 30,31 – 45,46 – 65), gender and regional accents. Each speaker was recorded in 1 or 2 different environments taking among 7 possible environments (STOP_MOTOR_RUNNING, LOW_SPEED_ROUGH_ROAD, HIGH_SPEED_GOOD_ROAD, and etc.).

3 kinds of vehicle (MAZDA 6, HONDA ACCORD and HONDA CRV) and 3 kinds of Microphone (Shure SM10A / Sennheiser ME104 / AKG C400BL) were used when recording. 4 high quality audio channels were used for speech collection.

A pronunciation lexicon is available with a phonemic transcription in CMU phone set. All audio files were manually transcribed and annotated by native transcribers. The corpus follows the general convention of SpeechDat-Car.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-139

The American English Speech Recognition Corpus was collected in USA.

The corpus contains 76,595(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 252 different speakers (135 males, 117 females) who were balanced distributed in age, gender and regional accents. Each speaker recorded in quiet and noisy environments.

Mobile platform, i.e. Android, was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in CMU. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-200

The American English Speech Recognition Corpus was collected in USA.

The corpus contains 102,574 utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 201 different speakers (113 males, 88 females) who were balanced distributed in age (16 – 30, 31 – 45, >45), gender and regional accents. Each speaker recorded in quiet and home environments.

Desktop platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in CMU. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-213

The American English Speech Recognition Corpus was collected in USA.

The corpus contains 119,318 utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 200 different speakers (102 males, 98 females) who were balanced distributed in age (16 – 30, 31 – 45, 46 – 60), gender and regional accents. Each speaker recorded in quiet office environment.

Desktop platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in SAMPA. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-215

The American English Speech Recognition Corpus was collected in USA.

The corpus contains 241,024 utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 201 different speakers who were balanced distributed in age (18 – 30, 31 – 45, 46 – 60), gender and regional accents. Each speaker recorded in quiet office environment.

Desktop platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in SAMPA. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

1 2