You are here:Home/English Web corpus (enTenTen corpus)
enTenTen: Corpus of the English Web
The English WebCorpus (enTenTen) is a text corpus created from the collected internet texts. The corpus belongs to the TenTen corpus family which is a set of the same processed web corpora with the target size 10+ billion words. Sketch Engine currently provides access to Tenten corpora in more than 30 languages.