Pages

SDeWaC corpus

SDeWaC is a subset of DeWaC. The creation of sDeWaC is described…

German Web Corpus (DeWaC)

The corpus was prepared by Marco Baroni in a web crawl as described…

GerManC. A Historical Corpus of German Newspapers 1650–1800

GerManC is a historical corpus of written German texts. (This…

deTenTen corpus

German TenTen corpus is a corpus from the TenTen class of corpora…