Pages

Chinese Tagset

A preview of a Chinese tagset. 普通名词 n common…

Internet-ZH corpus

Internet-ZH is a Chinese web corpus collected by Serge Sharoff.…

ChineseWiki corpus

The Chinese Wiki corpus is first segmented with Stanford Word…

ChineseTaiwanWaC corpus

Chinese Taiwan web as corpus has almost 260 million words encoded…

zhTenTen corpus

Simplified Chinese TenTen corpus was created from the Internet…