A page relevant to corpora.

Pages

huTenTen corpus

Hungarian web corpus was crawled by SpiderLing in June 2012.…

heTenTen corpus

Hebrew TenTen web corpus The corpus is cleaned by jusText,…

fiTenTen corpus

Finnish TenTen web corpus crawled by SpiderLing in February…

etTenTen corpus

Estonian web corpus was crawled by SpiderLing in 2013. It was…

elTenTen (Greek) corpus

Greek TenTen web corpus. The corpus has not been tagged yet. Structural…

daTenTen corpus

Danish TenTen web corpus. The corpus is cleaned by jusText,…

czTenTen corpus

Czech TenTen family web corpus crawled by SpiderLing in 2011…

caTenTen corpus

Catalan TenTen web corpus crawled in February and March 2014. Structural…