This translation memory consists of 24 collections of texts in Bulgarian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Irish, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Serbo-Croatian, Slovak, Slovenian, Spanish and Swedish language.
The aligned texts come from a large translation memory DGT published by The European Comission.
The individual corpora have been processed by the latest processing tools available in Sketch Engine.
More details / Reference publication
For a more detailed description of the DGT-TM, including more statistics on the resource, see the following publication. When making reference to DGT-TM in scientific publications, please refer to:
For a contrastive overview of DGT-TM and the other multilingual text resources offered for download on this site, you can read the following journal article:
DGT-TM has been registered with the International Standard Natural Language Resource number (ISLRN) 710-653-952-884-4.