ruSkELL: Russian Corpus for SkELL
Russian Corpus for SkELL is a text corpus specially built up for Rusian SkELL interface (ruSkELL) available at http://ruskell.sketchengine.co.uk/run.cgi/skell. The corpus does not contain whole documents but only sentences sorted according to their text quality. This score was computed by the GDEX system.
This corpus is consisted of texts (99.8 %) come from the Russian top level domain .ru, the most frequent web domains are kontrolnaja.ru, news.yandex.ru, alterauto.ru, pressarchive.ru and com.sibpress.ru covering just 0.09 % off all corpus documents.
These sources provide a good example of how Russian is used in everyday, standard, formal and professional context almost 1 billion words in more than 68 million sentences.