Entries by Ondřej Matuška

English Preposition Corpus

Our new unique English Preposition Corpus uncovers how prepositions behave and what senses they have. The corpus features special annotation for the sense of the preposition and also for the semantic class of the word that precedes and follows the preposition. The user can

Spanish: NEW rich collocations and NEW clitics handling

Our new Spanish Word Skteches give a much better coverage of Spanish-specific phenomena such as compound verb tenses, verb constructions, ser/estar or el subjuntivo. Spanish collocation information has never been so rich. decirnos, descargárselo, comerselo are examples of verbs with clitics which pose a problem when searching. Sketch Engine can now handle these much better, searching for decir […]

New Chinese Word Sketches

The Word Sketches for Simplified Chinese have been completely rewritten and provide a much more complete information about Chinese collocations.

Complex concordance searches are now easier

Complex concordance searches are now easier in Sketch Engine thanks to the new CQL manual. The content has been completely rewritten and reorganized into logical sections.

N’ko corpus

The World Organization for the Development of N’ko has decided to host their N’ko corpus with Sketch Engine. The corpus is hosted as an open corpus and is freely accessible even without a Sketch Engine account.

Sketch Engine workshop at Europhras 2017

Meet Sketch Engine at Europhras 2017 Sketch Engine will be attending Europhras 2017 in London to give talks and also a workshop where you can learn directly from the Sketch Engine experts about all the ins and outs of the system.

Jozef Stefan Institute Timestamped Corpus

Diachronic corpus of English A unique diachronic corpus of English newsfeeds, the Jozef Stefan Institute Timestamped web Corpus, has been added to Sketch Engine. It can also serve as an excellent contemporary web corpus.

XLIFF support for multilingual files

XLIFF support in Sketch Engine Sketch Engine users can now create their user corpora from texts in the XLIFF file format, a format used during localization processes and a standard for CAT tools.

languages of user corpora infographics

Do many Sketch Engine users create their own corpora? How popular is it to create your own corpus in Sketch Engine? And which languages are most popular among Sketch Engine users? Find out from this infographics.

Upgrade your lexicography and lexical computing skills

Your 5 days to get up-to-date with the latest developments in corpus-driven lexicography and to activate and enhance your corpus query skills with some of the top experts in the field. Location: Leiden, Netherlands Dates: 12 – 16 September 2017

Multiword word sketches

The Word Sketch input form now supports multiword expressions. The resulting word sketch will treat the multiword expression as one unit and display its collocations.

Adam Kilgarriff Prize: announcement of winner

On behalf of the Trustees of the Adam Kilgarriff Prize, I am delighted to announce that the inaugural Prize has been awarded to Pawel Rutkowski of the University of Warsaw. Dr. Rutkowski heads up a team which has developed (and continues to develop) a large annotated corpus of Polish sign language and the fully corpus-based Dictionary of Polish […]