Skip to main content
Comprehensive Language Knowledge Base (Institute of Computational Linguistics)
Share Dataverse

Share this dataverse on your favorite social media networks.

The Comprehensive Language Knowledge Base (CLKB) was built by Peking University Institute of Computational Linguistics since 1986. CLKB includes 6 language knowledge base, 10 specifications and standards, basic software tools and four application systems, which support each other to form an organic whole. CLKB series of language knowledge covers words, phrases, sentences, chapters of the various units and lexical, syntactic, semantic aspects, from Chinese to multi-language radiation, from the general field into the professional field.
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Find Advanced Search

1 to 10 of 40 Results
Mar 8, 2018
Yu, Shiwen; Duan, Huiming; Wu, Yunfang, 2018, "Corpus of Multi-level Processing for Modern Chinese", http://doi.org/10.18170/DVN/SEYRX5, Peking University Open Research Data Platform, V1
Peking University Institute of Computational Linguistics began to research the multi-level processing of the modern Chinese from 1992, and annotated corpus of the People's Daily, 1998 from April 1999 to April 2002. The modern Chinese multi-level processing corpus includes 52 mill...
Adobe PDF - 246.2 KB - MD5: 12cd6ab326e925a12412dc149f2235e5
Adobe PDF - 931.7 KB - MD5: fde534f5dc9d1161758fdad0a67765bd
JPEG Image - 636.9 KB - MD5: df2edebe8faf91de2f2e94f181b9598d
Adobe PDF - 1.3 MB - MD5: 8026bc7b49d4b6a86e895cb1ade60736
Unknown - 2.1 MB - MD5: 993d77248bf3b92e1c8f3f10bb528077
Adobe PDF - 155.4 KB - MD5: 963b5060a5430ecceb9d594770f1fb7e
Unknown - 702.9 KB - MD5: 391f61fd312fa0afa272422aac52b803
Jan 12, 2018
Sui, Zhifang; Yu, Shiwen, 2018, "Multi-domain Chinese-English Terminology Database", http://doi.org/10.18170/DVN/PUDSHB, Peking University Open Research Data Platform, V1
Terminology is a condensed form of specialized domain knowledge. In the practices of domain knowledge engineering, Peking University Institute of Computational Linguistics has accumulated a number of terminology databases in specialized fields, including: Sports Terminology Datab...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Peking University Open Research Data Platform Support

Peking University Open Research Data Platform Support

Please fill this out to prove you are not a robot.

+ =
Send Message