Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

Speechocean HomepageHelp

Home > Commercial Resources > ASR-Corpus

Recording Platform

>

All

Language

>

All

Sort By:Default

31 Results

King-ASR-034

 The Chinese Mandarin In-Car Speech Recognition Corpus was collected in China. It contains the voices of 20 different speakers (10 males, 10 females). The script contains 19,198(approx.) utterances in total, covering 4 categories (for more details of script structure design,please check the specification), specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker was recorded in 3 environments (0km/h, 30km/h, 60km/h) with 2 recording conditions (motor running, window up/down). 2 high quality audio channels were used in a car. The recording software (AUDIOREC) was developed by Speechocean. Each utterance wave was stored in a separate file as uncompressed PCM raw data. Each file was accompanied by an ASCII SAM label file which contains the relevant descriptive information. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with Specification.

King-ASR-120

The Chinese Mandarin In-Car Speech Recognition Corpus is collected in Beijing. It contains the voices of 160 different speakers (80males, 80 females) who were balanced distributed in age (mainly 16 – 30, 31 – 45, 46 – 65), gender and regional accents (for the details, please see the technical document). The script contains 304,244(approx.) utterances in total, covering 16 categories and  42 sub-categories (for more details of script structure design,please check the specification), specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker was recorded in 3 environments out of 5 optional environments (STOP_MOTOR_RUNNING, LOW_SPEED_ROUGH_ROAD, HIGH_SPEED_GOOD_ROAD, etc.) with various recording conditions (motor running, fan on/off, window up/down and etc). 4 high quality audio channels (Shure SM10A / Sennheiser ME104 /AKG Q400 /AKG Q400) were used in a car, at least 2 popular car models were adopted. The recording software (AUDIOREC) is developed by Speechocean. Each utterance wave was stored in a separate file as uncompressed PCM raw data. Each file is accompanied by an ASCII SAM label file which contains the relevant descriptive information. A pronunciation lexicon is available with a phonemic transcription in Pinyin. All manually checked; All audio files were manually transcribed and annotated by native transcribers. Details are available with Specification. The corpus follows the general convention of SpeechDat-Car.

King-ASR-120-1

The Chinese Mandarin In-Car Speech Recognition Corpus was collected in China. It contains the voices of 1,200 different speakers who were balanced distributed in age (mainly 16 – 30,31 – 45,46 – 60), gender and regional accents (for the details, please see the technical document). The script contains 514,028 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each utterance wave was stored in a separate file and uncompressed. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-ASR-121

This Korean In-car speech corpus was collected in Korea and contains the voices of 1000 different native speakers who were demographically balanced according to age (16-30, 31-45, 46-60), gender (500 males, 500 females) and dialectical regions.
The script was specially designed to provide material for both training and testing of many classes of speech recognizers which contains 16 general categories and more than 50 specific sub-categories. Each speaker was recorded under three driving environments (parked, city driving and highway driving) with recording conditions such as fan on/off and window up/down. A total of 300 utterances were recorded for each speaker in two of three driving environments (150 utterances and 10 spontaneous utterances per environment). Four high quality audio channels (C1: SHURE SM10A, C2: SENNHEISER ME104, F1: AKG Q400, F2: AKG Q400) and two popular cars in the country were used for this recording.
The speech data is stored as sequences of 16 kHz, 16 bit and is uncompressed and each prompted utterance is stored in a separate file and each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.A pronunciation lexicon with a phonemic transcription in SAMPA is also included. All the data was transcribed and labeled.

King-ASR-122

The Chinese Mandarin In-Car Speech Recognition Corpus is collected in Beijing. It contains the voices of 100 different speakers (50 males, 50 females) who were balanced distributed in age (mainly 18~30, 31~45, 46~60), gender and regional accents (for the details, please see the technical document). The script contains 200,796(approx.) utterances in total, covering 6 categories and 31 sub-categories (for more details of script structure design,please check the specification), specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker was recorded in 2 environments out of 6 optional environments (HIGH_SPEED_GOOD_ROAD, LOW_SPEED_COUNTRY_ROAD, TOWN_TRAFFIC, etc) with various recording conditions (motor running, fan on/off, window up/down and etc.). 4 high quality audio channels (Shure SM10A / AKG C400 BL / AKG C400 BL / AKG C400 BL) were used in a car, at least 2 popular car models were adopted. The recording software (AUDIOREC) is developed by Speechocean. Each utterance wave was stored in a separate file as uncompressed PCM raw data. Each file is accompanied by an ASCII SAM label file which contains the relevant descriptive information. A pronunciation lexicon is available with a phonemic transcription in Pinyin. All manually checked; All audio files were manually transcribed and annotated by native transcribers. Details are available with Specification. The corpus follows the general convention of Speech Dat-Car.

King-ASR-125

The Japanese In-Car Speech Recognition Database is collected in Japan. It contains the voices of 309 different native speakers (134males, 175 females) who were balanced distributed in age (mainly 16 – 30,31 – 45,46 – 65), gender and regional accents (for the details, please see the technical document). The script contains 98,981(approx.) utterances in total, 333.3 utterances for each speakers, covering 18 categories and 55 sub-categories (for more details of script structure design,please check the specification), specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker was recorded in 2 environments out of 7 optional environments (STOP_MOTOR_RUNNING, LOW_SPEED_ROUGH_ROAD,  HIGH_SPEED_GOOD_ROAD, etc) with various recording conditions (motor running, fan on/off, window up/down and etc). 4 high quality audio channels Shure SM10A / Sennheiser ME104 /AKG C400BL/AKG C400BL were used in a car, at least 3 popular car models were adopted. The recording software (AUDIOREC) is developed by Speechocean. Each utterance wave was stored in a separate file as uncompressed PCM raw data. Each file is accompanied by an ASCII SAM label file which contains the relevant descriptive information. All audio files were manually transcribed and annotated by native transcribers. Details are available with Specification. The corpus follows the general convention of SpeechDat-Car.

King-ASR-125-1

This Japanese in-car speech corpus was collected in Japan and contains the voices of 800 different native speakers who were demographically balanced according to Age (16-30, 31-45, 46-60), Gender (400 males, 400 females) and Dialectical Region. The script was specially designed to provide material for both training and testing of many classes of speech recognizers which contains 16 general categories and more than 50 specific sub-categories. Each speaker was recorded under three driving environments (parked, city driving and highway driving) with recording conditions such as fan on/off and window up/down. A total of 300 utterances were recorded for each speaker in two of three driving environments (150 utterances and 10 spontaneous utterances per environment). Four high quality audio channels (C1: SHURE SM10A, C2: SENNHEISER ME104, F1: AKG Q400, F2: AKG Q400) and three popular cars in the country were used in this recording. The speech data is stored as sequences of 16 kHz, 16 bit and is uncompressed and each prompted utterance is stored in a separate file and each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included. All the data was transcribed and labeled.

King-ASR-129

The Canada French In-Car Speech Recognition Corpus is collected in Montreal of Canada. It contains the voices of 328 different speakers (164males, 164 females) who were balanced distributed in age (mainly 16 – 30,31 – 45,46 – 60,>60), gender and regional accents (for the details, please see the technical document). The script contains 361,560(approx.) utterances in total,  covering 17 categories and 41 sub-categories (for more details of script structure design,please check the specification), specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker was recorded in 2 environments out of 7 optional environments (car stopped by motor running, car in town traffic noisy conditions, country road condition, etc.) with various recording conditions (motor running, fan on/off, window up/down and etc). 4 high quality audio channels (Shure SM10A / Sennheiser ME104 /AKG Q400/AKG Q400) were used in a car, at least 3 popular car models were adopted. The recording software (AUDIOREC) is developed by Speechocean. Each utterance wave was stored in a separate file as uncompressed PCM raw data. Each file is accompanied by an ASCII SAM label file which contains the relevant descriptive information. All audio files were manually transcribed and annotated by native transcribers. Details are available with Specification. The corpus follows the general convention of SpeechDat-Car.

King-ASR-131

The US English In-Car Speech Recognition Corpus is collected in USA. It contains the voices of 305 different  speakers (153 males, 152 females) who were balanced distributed in age (mainly 18 – 30, 31 – 45, 46 – 65), gender and regional accents (for the details, please see the technical document). The script contains 385,084(approx.) utterances in total, covering 16 categories and 52 sub-categories (for more details of script structure design,please check the specification), specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker was recorded in 1-2 environments out of 7 optional environments (parked, town traffic, country road, etc.) with various recording conditions (motor running, fan on/off, window up/down and etc). 4 high quality audio channels (Shure SM10A / Sennheiser ME104 / AKG C400BL / AKG C400BL) were used in a car, at least 3 popular car models were adopted. The recording software (AUDIOREC) is developed by Speechocean. Each utterance wave was stored in a separate file as uncompressed PCM raw data. Each file is accompanied by an ASCII SAM label file which contains the relevant descriptive information. A pronunciation lexicon is available with a phonemic transcription in CMU phoneset. All manually checked; All audio files were manually transcribed and annotated by native transcribers. Details are available with Specification. The corpus follows the general convention of SpeechDat-Car.

King-ASR-132

The France French in-car speech recognition database was collected in France and contains the voices of 300 different native speakers (150 males, 150 females) who were balanced according to age (mainly 16-30, 31-45, 46-60), gender and regional accents (for the details, please see the technical document). The script was specially designed to provide material for both training and testing of many classes of speech recognizers which contains 16 general categories and more than 50 specific sub-categories. Each speaker was recorded under two of three driving environments (parked, city driving and highway driving) with recording conditions such as fan on/off and window up/down. Each speaker will be recorded 150 utterances and spontaneous utterances per environment and totally 300 utterances were recorded for each speaker. Four high quality audio channels (C1: SHURE SM10A, C2: SENNHEISER ME104, F1: AKG Q400, F2: AKG Q400) and at least two popular cars in the country were used. ; The speech data is stored as sequences of 16 kHz, 16 bit and is uncompressed and each prompted utterance is stored in a separate file and each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included. All the data was transcribed and labeled.

1 2 3 4