1 to 4 of 4 Results
Mar 8, 2018
Yu, Shiwen; Duan, Huiming; Wu, Yunfang, 2018, "Corpus of Multi-level Processing for Modern Chinese", http://doi.org/10.18170/DVN/SEYRX5, Peking University Open Research Data Platform, V1
Peking University Institute of Computational Linguistics began to research the multi-level processing of the modern Chinese from 1992, and annotated corpus of the People's Daily, 1998 from April 1999 to April 2002. The modern Chinese multi-level processing corpus includes 52 mill... |
Jan 12, 2018
Sui, Zhifang; Yu, Shiwen, 2018, "Multi-domain Chinese-English Terminology Database", http://doi.org/10.18170/DVN/PUDSHB, Peking University Open Research Data Platform, V1
Terminology is a condensed form of specialized domain knowledge. In the practices of domain knowledge engineering, Peking University Institute of Computational Linguistics has accumulated a number of terminology databases in specialized fields, including: Sports Terminology Datab... |
Jan 3, 2018
Yu, Shiwen, 2018, "Knowledge Base of Phrase Structure in Modern Chinese", http://doi.org/10.18170/DVN/NPDNSO, Peking University Open Research Data Platform, V1
The Knowledge Base of Phrase Structure in Modern Chinese contains 676 structural rules for Chinese phrases (including compound words) which are context-free grammar rules. There are three sample libraries released this time: 160 rules that contain the adjective, 184 rules that co... |
Jan 3, 2018
Yu, Shiwen, 2018, "CLKB Common Material", http://doi.org/10.18170/DVN/XR0STB, Peking University Open Research Data Platform, V1
This dataset holds relevant information about the CLKB, such as the introduction of CLKB, award certificates and related information about the authors. |