A page relevant to corpora.

Pages

Algemeen Nederlands Woordenboek (ANW) corpus

The Algemeen Nederlands Woordenboek (ANW) corpus is a balanced…

New Model Corpus

The New model Corpus is a ~100 million words domain corpus built…

London English corpora

The corpus consists of transcripts of informal conversation-like…

zhTenTen corpus

Simplified Chinese TenTen corpus was created from the Internet…

yoTenTen corpus

Yoruba TenTen web corpus. The corpus is cleaned by jusText,…

uaTenTen corpus

Ukrainian TenTen corpus was crawled by SpiderLing in 2014.…

trTenTen corpus

Turkish TenTen corpus. Crawled by SpiderLing in December 2011…

svTenTen corpus

Swedish TenTen web corpus. The corpus is cleaned by jusText,…

skTenTen corpus

Slovak TenTen corpus. The corpus has been tagged by the ​Ľ.…

ptTenTen corpus

Portuguese TenTen corpus. The corpus is processed with Eckhard…

Norwegian Web corpus (noTenTen)

The Norwegian Web 2015 is a web corpus from TenTen corpora…

nlTenTen corpus

Dutch TenTen web corpus. The corpus is cleaned by jusText,…