Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

人工智能数据资源服务平台 Speechocean Homepage Help

Home > Product search_spontaneous

60 Results

King-ASR-016

The Chinese Mandarin Speech Recognition Corpus was collected in China.

This corpus contains the voices of 50 different speakers (25 males, 25 females) who were balanced distributed in age, gender and regional accents. Each speaker recorded in quiet environment.

Desktop platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in pinyin. All data were manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-L-016

This Chinese Mandarin Speech Recognition Corpus contains the voices of 50 different native speakers. Totally 56 topics were specially designed for testing of speech recognizers. Each speakers uttered 11 topics with spontaneous speech and all speech were recorded through one professional microphones in a quiet office. All the speech data was transcribed and labeled.

King-ASR-L-017

This Chinese Mandarin Speech Recognition Corpus contains the voices of 20 different native speakers. It includes 3405 recording files in about 11.52 hours. A set of 166 topics were specially designed for testing of speech recognizers. Each speakers uttered 166 utterances including Stock, Person name, Digit String, and single Chinese sentences, meanwhile one topic was given to each speaker for spontaneous speech. All speech were recorded through one professional microphones in a quiet office. All the speech data was transcribed and labeled.

King-ASR-L-022

This Chinese Mandarin Speech Recognition Corpus was collected in China and contains the voices of 20 different native speakers who were demographic balanced according to age distribution (12-60), gender and regional accents (covering 7 typical dialectical regions with 30 Provinces and regions). Totally 40 topics were specially designed for both training and testing of speech recognizers. Each speakers uttered specific topics spontaneously which were randomly selected from the pool. All speakers were recorded in a quiet office room through two professional microphones. All the speech data was transcribed and labeled.

King-ASR-027

The Chinese Mandarin Speech Recognition Corpus was collected in China.

The corpus contains 1,638(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 648 different speakers (339 males, 309 females) who were distributed in different ages(6 – 15, 16 – 30, 31 – 45, 46 – 60), genders and regional accents. Each speaker recorded in quiet environment.

Telephone platform was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in Pinyin. All data were manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-L-027

This Chinese Mandarin Speech Recognition Corpus was collected in China and contains the voices of 20 different native speakers who were demographic balanced according to age distribution (12-60), gender and regional accent. It includes 28 audio files and with 131.7 hours of recording. A set of 40 topics were specially designed for both training and testing of speech recognizers. Each speaker uttered spontaneously based on the topics which were randomly selected from the pool. All speakers were recorded in a quiet office room through recording system on a telephone by dialing the phone. All the speech data was transcribed and labeled.

King-ASR-046

The Korean Speech Recognition Corpus was collected in China.

The script contains 289 groups of daily spontaneous conversational speech in total, specially designed to provide materials for both training and testing of speech recognizers.

This corpus contains the voices of 537 different speakers (253 males and 284 females) who were balanced distributed in age (18 – 35, 36 – 45, >45), gender and regional accents. Each speaker was recorded in quiet environment.

Mobile platform, i.e. Android was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in KO-KR_RR. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

For more details, please check the technical document or ask our sales people.

Contact Information:
Phone: +86-10-62660053
Email: contact@speechocean.com

King-ASR-L-055

This Japanese Speech Recognition Corpus was collected in Japan and contains the voices of 20 different native speakers who were demographic balanced according to age distribution (18-35, 36-45, 46-60), gender, and dialectical regions. The corpus contains 100 pairs of spontaneous dialog speech data which were from 20 speakers. Each pair of speech consists of 3 audio files: two of them from single speaker separately and the other is from the mixed channel. The three files were recorded simultaneously. The pure recording time of mixed channel is about 0.78 hours. 33 topics were contained in this database. There are 679 audio files which were saved as uncompressed PCM files. All the speech data was transcribed and labeled.

King-ASR-L-111

This is a Mandarin speech database, which is collected in China over Roland R-44 recorder with headset microphone in the environment of quiet office. The corpus contains the 20 speakers of spontaneous dialog speech. The pure recording time is about 10.27 hours. 25 topics were contained in this database. The total size of this database is 3.03G. All audio files were manually transcribed and annotated.

King-ASR-L-117

This Japanese Speech Recognition database was collected in Japan and contains the voices of 20 different native speakers who were demographic balanced according to age distribution (16-30, 31-45, 46+), Gender and Dialectical Regions. The script was specially designed to provide material for both training and testing of many classes of speech recognizers. It contains 300 utterances covering 15 categories and 35 sub-categories recorded by each speaker (for the detail script and the structure design, please see the technical document). Each speaker recorded under two environments, a quiet session (Office/Home) and a noisy session (Garden/roadside/restaurant/bus). Every speaker recorded 150 utterances and spontaneous sentences per environment with a total of 300 utterances. Popular mobiles in the country were used for collecting this data such as Samsung, Nokia, HTC etc. The speech data is stored as sequences of 16 kHz, 16 bit and uncompressed. Each utterance is stored in a separate file and each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.

1 2 3 4 5 6