A page relevant to corpora.

Pages

daTenTen corpus

Danish TenTen web corpus. The corpus is cleaned by jusText,…

czTenTen corpus

Czech TenTen family web corpus crawled by SpiderLing in 2011…

caTenTen corpus

Catalan TenTen web corpus crawled in February and March 2014. Structural…

bgTenTen corpus

Bulgarian TenTen corpus crawled by SpiderLing in November 2012.…