Hausa WaC (web as corpus) is prepared by Corpus factory method described here. Full details are described in Kilgarriff et al. at LREC 2010. The corpus contains 5.3 million words and has not word sketches. The access to the corpus is restricted.

Changelog

v 1.0 (June 23, 2015)

  • initial version