Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

人工智能数据资源服务平台 Speechocean Homepage Help

Home > Academic Resources

Recording Platform

>

All

Language

>

All

Sort By:Default

251 Results

King-IM-Package

The image database package contains:

1. Web-crawling facial database---100,000 images
Covering 3 skin color, about 3500 individuals at all ages.

2. 21 points annotation facial database---10,000 images
100 individuals, 100 images/person. Gender and age evenly distributed.

3. OCR database---2,000 images
English menu, annotated by word level.
English guide board, annotated by word level.

4. Vehicle feature database---21,000 images
7 features annotation, including driver, shotgun seatbelt, decoration and etc. 3,000 images per feature.

Credits: 1000.00

King-NLP-L-001

This corpus contains 10,000 Chinese sentences which collected from the real emails of Chinese native speakers. All words were proofread manually and all the sensitive words and repeated sentences were filtered in the pure word layer.
*Only for domestic market.

Credits: 300.00

King-ASR-L-001

This Chinese Mandarin Speech Recognition Corpus contains the voices of 20 different native speakers (10 males, 10 females) who were balanced according to age (mainly 16-28, 29-45), gender and regional accents. A set of person names and place names were specially designed for both training and testing of speech recognizers. It includes 959 files with about 1.18 recording hours. Each speaker was recorded in a quiet environment. All the speech data was transcribed and labeled.

Credits: 86.00

KDC-TTS-001

The Female Chinese TTS Corpus contains the read speech of 1 native Chinese Female professional broadcaster recorded in a studio with high SNR (>35dB) over Two Channels (AKG C4000B microphone and Electroglottography (EGG) sensor).
This corpus contains 500 utterances.

Credits: 2000.00

King-NLP-L-002

This data contains 1,000 SMS sentences collected from the real life of Chinese native speakers. All short message sentences were proofread manually, repeated sentences were filtered, etc. The whole data is formed by four layers of pure word, Pinyin with tone, word segmentation and name entity.
*Only for domestic market.

Credits: 60.00

King-ASR-L-002

This Chinese Mandarin Speech Recognition Corpus was collected in China and contains the voices of 20 different native speakers were balanced according to age (mainly 16–28, 29–45), gender and regional accents (26 Provincial cities and regions). The Corpus contains 899 audio files. All the speech data was transcribed and labeled.

Credits: 65.00

KDC-TTS-002

The Female Chinese TTS Corpus contains the read speech of one native Chinese Female professional broadcaster recorded in a studio with high SNR (>35dB) over Two Channels (AKG C4000B microphone and Electroglottography (EGG) sensor).
This TTS corpus contains 500 utterances.

Credits: 2000.00

King-ASR-L-003

This Chinese Mandarin Speech Recognition Corpus was collected in China and contains the voices of 265 different native speakers (134 males, 131 females) who were balanced according to age (mainly 16–28,29–45), gender and regional accents (26 province and regions were covered). A set of 5,309 digital strings were specially designed for both training and testing of speech recognizers. 201 speakers uttered 30 digital strings, 64 speakers uttered 25 digital strings. Each speaker was recorded in a quiet environment. All the speech data was transcribed and labeled.

Credits: 600.00

KDC-TTS-003

The Male German TTS Corpus contains the read speech of two native German male professional broadcaster recorded in a studio with high SNR (>35dB) over Two Channels (AKG C4000B microphone and Electroglottography (EGG) sensor) for M1.
This TTS corpus contains 1000 utterances.

Credits: 4000.00

King-NLP-L-003-1

This data contains 10,000 SMS sentences collected from the real life of Chinese native speakers. All short message sentences were proofread manually, repeated sentences were filtered in the pure word layer and all the sentences were annotated with Pinyin with tone information.
*Only for domestic market.

Credits: 240.00

1 2 3 4 5 6 7 8 9 10