DOAJ corpora – Directory of Open Access Journals
The Directory of Open Access Journals (DOAJ) corpora are text corpora comprised of journals covering all areas of science, technology, medicine, social science, and humanities in dozens of languages.
The DOAJ corpora contain rich metadata about journals, such as title, country, year of publication, etc. It is also possible to search by the keywords of articles.
Detailed information about Directory of Open Access Journals can be found on the original website.
A list of DOAJ corpora in Sketch Engine
- Directory of Open Access Journals (English) – 2.6 billion words
More languages will be available soon.
DOAJ corpora are POS tagged depending on language specifications.