The English Wikipedia corpus was built using English Wikipedia dump (from the second half of September 2014). The XML was converted using WikiExtractor.py. The corpus contains more than 1.3 billion words.

Search the English Wikipedia corpus

Sketch Engine offers a range of tools to work with the English Wikipedia corpus.

or