Tatar sample corpus is ca 200 thousand words crawled from the web in the year 2015. The text in the corpus is tokenised.
Adam Kilgarriff Prize
Adam Kilgariff (1960-2015) was a British corpus linguist and founder of Lexical Computing, the company behind Sketch Engine. Adam devoted his whole life to research at the intersection of corpus linguistic, computational linguistics and lexicography.
To honour our brilliant and much-loved colleague, we established the Adam Kilgarriff Prize for outstanding work in the fields to which Adam contributed so much: corpus linguistics, computational linguistics, and lexicography.