Welcome to KingLine Data Center!   Contact Phone: 0086-10-62660053   Email: marketing@speechocean.com

English

人工智能数据资源服务平台 Speechocean Homepage Help

Home > Academic Resources > NLP-Corpus

Language

>

All

Sort By:Default

20 Results

King-NLP-L-001

This corpus contains 10,000 Chinese sentences which collected from the real emails of Chinese native speakers. All words were proofread manually and all the sensitive words and repeated sentences were filtered in the pure word layer.
*Only for domestic market.

Credits: 300.00

King-NLP-L-002

This data contains 1,000 SMS sentences collected from the real life of Chinese native speakers. All short message sentences were proofread manually, repeated sentences were filtered, etc. The whole data is formed by four layers of pure word, Pinyin with tone, word segmentation and name entity.
*Only for domestic market.

Credits: 60.00

King-NLP-L-003-1

This data contains 10,000 SMS sentences collected from the real life of Chinese native speakers. All short message sentences were proofread manually, repeated sentences were filtered in the pure word layer and all the sentences were annotated with Pinyin with tone information.
*Only for domestic market.

Credits: 240.00

King-NLP-L-003-2

This data contains 40,000 SMS sentences collected from the real life of Chinese native speakers. All short message sentences were proofread manually, repeated sentences were filtered in the pure word layer and all the sentences were annotated with Pinyin with tone information.
*Only for domestic market.

Credits: 960.00

King-NLP-L-004-1

This data contains 10,000 SMS sentences collected from the real life of Chinese native speakers. All short message sentences were proofread manually, repeated sentences were filtered in the pure word layer and all the sentences were annotated with word segmentation information.
*Only for domestic market.

Credits: 200.00

King-NLP-L-004-2

This data contains 40,000 SMS sentences collected from the real life of Chinese native speakers. All short message sentences were proofread manually, repeated sentences were filtered in the pure word layer and all the sentences were annotated with word segmentation information.
*Only for domestic market.

Credits: 800.00

King-NLP-L-005

This data contains 10,000 SMS sentences collected from the real life of Chinese Native speakers. All short message sentences were proofread manually, repeated sentences were filtered in the pure word layer and all the sentences were annotated with name entity information.
*Only for domestic market.

Credits: 200.00

King-NLP-L-006-1

This data contains 10,000 SMS sentences collected from the real life of Chinese Natives. All short message sentences were proofread manually, repeated same sentences were filtered, etc. ;The whole data is formed of four layers.
*Only for domestic market.

Credits: 200.00

King-NLP-L-006-2

This data contains 50,000 SMS sentences collected from the real life of Chinese Natives. All short message sentences were proofread manually, repeated same sentences were filtered, etc. The whole data is formed of four layers.
*Only for domestic market.

Credits: 1000.00

King-NLP-L-007

This corpus contains 10,000 Chinese characters collected from the real instance messages of Chinese Natives. All words were proofread manually, sensitive words, repeated same sentences were filtered in the pure word layer.
*Only for domestic market.

Credits: 200.00

1 2