The SoNaR corpus is a 500-million-word reference corpus of contemporary written Dutch.

Access policy

To get access to this corpus, please contact the service desk of the Instituut voor Nederlandse Lexicologie (Institute for Dutch Lexicology) at servicedesk@inl.nl.


Related paper

Nelleke Oostdijk , Martin Reynaert, Véronique Hoste, Ineke Schuurman. The construction of a 500-million-word reference corpus of contemporary written Dutch. In Essential speech and language technology for Dutch, pp. 219-247, 2013.