Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

Speechocean HomepageHelp

Home > Commercial Resources > TTS-Corpus

Recording Platform

>

All

Language

>

All

Sort By:Default

33 Results

King-TTS-001

The Chinese Mandarin Speech Synthesis Corpus contains the recordings of 1 voice talent (female, professional broadcaster, 30 years old, born and raised in Beijing). It is recorded in a professional studio over two channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 12,679 (approx.) utterances in total. There are 20 categories in this corpus, which contains the most common language phenomenon in Chinese Mandarin. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded.

King-TTS-002

The Chinese Mandarin Speech Synthesis Corpus contains the recordings of 1 voice talent (female, professional broadcaster, 30 years old, born and and raised in Beijing). It is recorded in a professional studio over two channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 6830 (approx.) utterances in total. There are 8 categories in this corpus, which contains the most common language phenomenon in Chinese Mandarin. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded.

King-TTS-003

The Chinese Mandarin Speech Synthesis Corpus contains the recordings of 1 voice talent (female, professional broadcaster, 28 years old, born and and raised in Beijing). It is recorded in a professional studio over 2 channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 19,509 (approx.) utterances in total. There are 28 categories in this corpus, which contains the most common language phenomenon in Chinese Mandarin. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Prosody Labeling and Syllable Labeling were adopted based on the real speech. A pronunciation lexicon is available with a phone set in Pinyin. All manually checked. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded.

King-TTS-004

The Arabic Speech Synthesis Corpus contains the recordings of 1 voice talent (male, professional broadcaster, 52 years old, born and and raised in Cairo). It is recorded in a professional studio over 2 channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 8058 (approx.) utterances in total. There are 12 categories in this corpus, which contains the most common language phenomenon in Arabic. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded.

King-TTS-005

The Arabic Speech Synthesis Corpus contains the recordings of 1 voice talent (male, professional broadcaster, 52 years old, who was born and raised in Cairo). It is recorded in a professional studio over 2 channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 8059 (approx.) utterances in total. There are 12 categories in this corpus, which contains the most common language phenomenon in Arabic. Each utterance wave is stored in Windows uncompressed PCM format. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded.

King-TTS-006

The British English Speech Synthesis Corpus contains the recordings of 1 voice talent (female, professional broadcaster, 32 years old, born and and raised in London). It is recorded in a professional studio over 2 channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 10,146 (approx.) utterances in total. There are 31 categories in this corpus, which contains the most common language phenomenon in British English. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Manual Prosody labeling, Stress Labeling, Phone Boundary Labeling were adopted based on the real speech. A pronunciation lexicon is available with a phone set. All manually checked. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded."

King-TTS-007

The British English Speech Synthesis Corpus contains the recordings of 1 voice talent (male, professional broadcaster, 32 years old, born and and raised in London). It is recorded in a professional studio over 2 channels (Shure SM15 microphone and Electroglottography (EGG) sensor). The script contains 10126 (approx.) utterances in total. There are 31 categories in this corpus, which contains the most common language phenomenon in British English. Speech rate consistency, energy consistency and tone color consistency are strictly controlled during recording process. Each utterance wave is stored in Windows uncompressed PCM format. Manual Prosody labeling, Stress Labeling, Phone Boundary Labeling were adopted based on the real speech. A pronunciation lexicon is available with a phone set in BEEP. All manually checked. Each utterance was carefully proofread by linguists, to make sure the phonetic symbol is consistent with the real pronunciation. Misread utterances have been re-recorded.

King-TTS-008

King-TTS-009

King-TTS-010

1 2 3 4