Caja corpus is a balanced corpus of Academic Journal Aricles created by Iztok Kosem in 2010, for more information, see his PhD thesis (below). The corpus has 79 million words and consists of 13,116 articles from 28 different disciplines.

Availability

The access to the corpus is restricted. For more information please contact us at support@sketchengine.co.uk.

Reference

Kosem, Iztok. Designing a model for a corpus-driven dictionary of Academic English. PhD Thesis. Aston University, 2010.

Explore other domain corpora in Sketch Engine

See a list of corpora prepared from specific domains in Sketch Engine.

or