CHILDES corpora are comparable corpora made up from transcripts of child language. Most of these transcripts record spontaneous conversational interactions. Often the speakers involved are young monolingual children conversing with their parents or siblings. Corpora also comprise transcripts of bilingual children, older school-aged children, adult second-language learners, children with various types of language disabilities and aphasics who are trying to recover from language loss.

These corpora belong to the child language project within the TalkBank system that was created for sharing and studying conversational interactions. Current CHILDES corpora in Sketch Engine include 24 languages.

CHILDES TalkBank web page is available at http://childes.talkbank.org/

Detailed information about each corpus within CHILDES collection can be found at http://childes.talkbank.org/access/

The overview of CHILDES corpora in Sketch Engine

The following list of CHILDES corpora contains link(s) to the particular corpus pages with detailed information. Each corpus page has the link Download transcripts to a zip archive containing a file called 0metadata.cdc where are stored all metadata of the particular corpus.

CHILDES Afrikaans

More information can be found at http://childes.talkbank.org/access/Dutch/ (section Afrikaans)

CHILDES Catalan

More information can be found at http://childes.talkbank.org/access/Biling/Serra.html

CHILDES Croatian

More information can be found at http://childes.talkbank.org/access/Slavic/Croatian/Kovacevic.html

CHILDES Danish

  • http://childes.talkbank.org/access/German/
  • Klammler: http://childes.talkbank.org/access/Biling/Klammler.html
  • Koroschetz: http://childes.talkbank.org/access/Biling/Koroschetz.html

CHILDES English

  • http://childes.talkbank.org/access/German/
  • Klammler: http://childes.talkbank.org/access/Biling/Klammler.html
  • Koroschetz: http://childes.talkbank.org/access/Biling/Koroschetz.html

CHILDES Estonian

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/ (section Estonian)

CHILDES Farsi (Persian)

More information can be found at http://childes.talkbank.org/access/Other/Farsi/Family.html

CHILDES French

More information about particular collections can be found at:

  • Rondall http://childes.talkbank.org/access/French/Rondal.html
  • Vioncolas http://childes.talkbank.org/access/French/VionColas.html

CHILDES Gaelic (Irish)

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Celtic/Irish/Guilfoyle.html

CHILDES German

More information about particular collections of the corpus can be found at

  • http://childes.talkbank.org/access/German/

and

  • Klammler collection: http://childes.talkbank.org/access/Biling/Klammler.html
  • Koroschetz collection: http://childes.talkbank.org/access/Biling/Koroschetz.html

CHILDES Hebrew

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/ (in the section Hebrew)

CHILDES Hungarian

More information about particular collections of the corpus can be found at

  • http://childes.talkbank.org/access/Other/ (in the section Hungarian)
  • http://childes.talkbank.org/access/narrative.html (MacBates collections)

CHILDES Italian

More information about particular collections can be found at:

  • http://childes.talkbank.org/access/Romance/
  • http://childes.talkbank.org/access/narrative.html

CHILDES Japanese

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Japanese/

CHILDES Korean

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/EastAsian/Korean/Jiwon.html

CHILDES Norwegian

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Scandinavian/Norwegian/Simonsen.html

CHILDES Polish

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Slavic/Polish/Szuman.html

CHILDES Portuguese

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Romance/ (section Portuguese)

CHILDES Russian

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Slavic/Russian/Protassova.html

CHILDES Spanish

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Spanish/

CHILDES Swedish

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/

CHILDES Tamil

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/Tamil/Narasimhan.html

CHILDES Thai

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/

CHILDES Turkish

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Frogs/ (search Turkish)

Search the CHILDES corpora in Sketch Engine

Sketch Engine offers a range of tools to work with the CHILDES corpora.

or

Other text corpora in Sketch Engine

Sketch Engine offers 350+ language corpora.

Use Sketch Engine in minutes

Generating collocations, frequency lists, examples in contexts, n-grams or extracting terms is easy with Sketch Engine. Use our Quick Start Guide to learn it in minutes.