Word Sketches definition files

The following files can be used for building word sketches in…

Setting up a learner corpus

How to set up a vertical file for a Learner Corpus The errors…

Virtual Corpus

What is a virtual corpus? A virtual corpus is a corpus that…

The Sketch Engine Changelog

This is the changelog for the web user interface. You can also…

Command line tools generating and viewing n-grams

There is a number of utilities available in Finlib/Manatee that…

Text Types, Headers and Subcorpora

Overview For many kinds of language study, text type is important.…

Preparing Corpus Text

The input format is "vertical" or "word-per-line (WPL)" text,…

Hebrew Translational Corpus

Also referred to as "Hebrew Comparable Corpus", uploaded in 2010. The…

czes corpus

CZES is a Czech corpus consisting of newspaper articles and magazine…

Urdu

The web corpus containing 53 million words built with Corpus…

Turkic web corpora

There are the following Turkic language family corpora in Sketch…

TalkBank Persian

The TalkBank Persian corpus contains blog posts to various Farsi…

TED_en corpus

A corpus of transcripts of TED talks. Prepared by Akshay Min…