A page relevant to corpora.

Pages

FinnishWaC corpus

Finnish web as corpus.

danishWaC corpus

The corpus prepared by Corpus factory method. It has 288 million…

ScienceBlog corpus

The ScienceBlogs corpus is a selection of posts and comments…

e-flux corpus

The e-flux corpus is a web corpus of English art news digests.…

Corpus TECU – Geodetics web corpus

(information in Czech language) Tvorba specializovaných dat…

Environment corpus

English environment related web corpus. Crawled by SpiderLing…

Filipino web corpus (FilipinoWaC)

The corpus was created by Anil in October 2013. It has almost…

Nineteenthcentury corpus

Actually, the 19th century corpus is only available to Osnabrück…

Penn Historical Corpora

Penn Historical Corpora is a collection of historical English…