This translation memory consists of 24 collections of texts in Bulgarian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Irish, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Serbo-Croatian, Slovak, Slovenian, Spanish and Swedish language.

The aligned texts come from a large translation memory DGT published by The European Comission.

The individual corpora have been processed by the latest processing tools available in Sketch Engine.


More details / Reference publication

For a more detailed description of the DGT-TM, including more statistics on the resource, see the following publication. When making reference to DGT-TM in scientific publications, please refer to:

For a contrastive overview of DGT-TM and the other multilingual text resources offered for download on this site, you can read the following journal article:

DGT-TM has been registered with the International Standard Natural Language Resource number (ISLRN) 710-653-952-884-4.