Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

Speechocean HomepageHelp

Home > Commercial Resources

Recording Platform

>

All

Language

>

All

Sort By:Default

516 Results

King-IM- 001

-800 Individuals, including Yellow(300), Black(200) and White(300)
-2 Shooting Angles: 0-7 degree & 7-14 degree
-46 Images/Individual, 36800 Images in total
-4 Emotional Expressions: Normal/Happy/Sad/Angry
-6 Mouth Movements: Normal; Mouth Open; Mouth Open; O Type; Mouth Honk; Mouth Toothless
-2 Types of Eye Brows Movements: Rais; Frown
-2 Types of Eye Movements: Eyes Open; Eyes Closed
-9 Head's Orientations: Upper-Left; Upper-Front; Upper-Right; Left; Front; Right; Lower-Left; Lower-Front; Lower-Right
-Format: MP4/AVI

King-AVT-001

This database contains 300 hours of speech which was transcribed and annotated from the audios collected TV shows, Talk Shows, entertainment programs etc.

King-TTS-001

The Chinese Mandarin Speech Synthesis Corpus contains the recordings of 1 voice talent (female, professional broadcaster, 30 years old, born and raised in Beijing). It is recorded in a professional studio over two channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 12,679 (approx.) utterances in total. There are 20 categories in this corpus, which contains the most common language phenomenon in Chinese Mandarin. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded.

King-NLP-001

This corpus contains 3,050,000 Chinese characters collected from the real Emails of Chinese native speakers. All words were proofreaded manually and all the sensitive words and repeated sentences were filtered in the pure word layer.

King-ASR-001

The Chinese Mandarin Mobile Speech Recognition Corpus was collected in China. It contains the voices of 265 different speakers (134 males, 131 females) who were balanced distributed in age (mainly 16 – 28,29 – 45), gender and regional accents (for the details, please see the technical document). The script contains 13,942 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Mobile platform, i.e. Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-AVT-002

This database contains 100 hours of speech which was transcribed and annotated from home videos collected from real life of Chinese native speakers.

King-TTS-002

The Chinese Mandarin Speech Synthesis Corpus contains the recordings of 1 voice talent (female, professional broadcaster, 30 years old, born and and raised in Beijing). It is recorded in a professional studio over two channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 6830 (approx.) utterances in total. There are 8 categories in this corpus, which contains the most common language phenomenon in Chinese Mandarin. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded.

King-NLP-002

This data contains 100,000 SMS sentences collected from the real life of Chinese native speakers. All short message sentences were proofreaded manually, repeated sentences were filtered, etc.; The whole data is formed by four layers of Pure word, Pinyin with Tone, Word Segmentation and Name Entity.

King-ASR-002

The Chinese Mandarin Telephone Speech Recognition Corpus was collected in China. It contains the voices of 285 different speakers (144 males, 141 females) who were balanced distributed in age (mainly 16 – 28, 29 – 45), gender and regional accents (for the details, please see the technical document). The script contains 14492 (approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet enviroments.Desktop platform, i.e. Windows 2000 was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. Pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

King-IM-002

-2500 Individuals, Chinese (1200), Korean(1300)
-Recorded by 3 Cameras, 12 Images/Camera/Individual, 90,000 Images in Total
-4 Emotional Expressions: Normal/Happy/Sad/Angry
-6 Mouth Movements: Normal, Mouth Open, Mouth Open, O Type, Mouth Honk, Mouth Toothless
-2 Types of Eye Brows Movements: Rais, Frown
-Format: JPG, 800*600

1 2 3 4 5 6 7 8 9 10