Pages

Tagset reference for RFTagger, German

RFtagger tags are listed below. They are largely more detailed…

SDeWaC corpus

SDeWaC is a subset of DeWaC. The creation of sDeWaC is described…

German Web Corpus (DeWaC)

The corpus was prepared by Marco Baroni in a web crawl as described…

GerManC. A Historical Corpus of German Newspapers 1650–1800

GerManC is a historical corpus of written German texts. (This…

deTenTen corpus

German TenTen corpus. The corpus is double-tagged with RFTagger…