A page relevant to corpora.

Pages

elTenTen (Greek) corpus

Greek TenTen web corpus. The corpus has not been tagged yet. Structural…

deTenTen corpus

German TenTen corpus. The corpus is double-tagged with RFTagger…

daTenTen corpus

Danish TenTen web corpus. The corpus is cleaned by jusText,…

czTenTen corpus

Czech TenTen family web corpus crawled by SpiderLing in 2011…

caTenTen corpus

Catalan TenTen web corpus crawled in February and March 2014. Structural…

bgTenTen corpus

Bulgarian TenTen corpus crawled by SpiderLing in November 2012.…

arTenTen corpus

Arabic Web corpus of TenTen family corpora was crawled by SpiderLing…