The Digital Corpus of the European Parliament (DCEP) is a collection of documents published on the European Parliament’s official website. This parallel corpus contains texts in 23 languages.

For more information set the following websites:

File alignment statistics for all pairs


Sentence alignment statistics for all pairs


References – Relevant publications

For a more detailed description of DCEP and when making reference to DCEP in scientific publications, please refer to:

To compare DCEP with the other linguistic resources distributed by EU institutions, see:

To see how DCEP was added to Sketch Engine, see EUR-Lex.