CHILDES - child language corpus in many languages

CHILDES corpora are comparable corpora made up from transcripts of child language. Most of these transcripts record spontaneous conversational interactions. Often the speakers involved are young monolingual children conversing with their parents or siblings. Corpora also comprise transcripts of bilingual children, older school-aged children, adult second-language learners, children with various types of language disabilities and aphasics who are trying to recover from language loss.

These corpora belong to the child language project within the TalkBank system that was created for sharing and studying conversational interactions. Current CHILDES corpora in Sketch Engine include 24 languages.

CHILDES TalkBank web page is available at http://childes.talkbank.org/

Detailed information about each corpus within CHILDES collection can be found at http://childes.talkbank.org/access/

Availability

CHILDES corpora are accessible to users with a paid subscription, see our price list.

The overview of CHILDES corpora in Sketch Engine

The following list of CHILDES corpora contains link(s) to the particular corpus pages with detailed information. Each corpus page has the link Download transcripts to a zip archive containing a file called 0metadata.cdc where are stored all metadata of the particular corpus.

CHILDES Afrikaans

More information can be found at http://childes.talkbank.org/access/Dutch/ (section Afrikaans)

CHILDES Catalan

More information can be found at http://childes.talkbank.org/access/Biling/Serra.html

CHILDES Croatian

More information can be found at http://childes.talkbank.org/access/Slavic/Croatian/Kovacevic.html

CHILDES Danish

http://childes.talkbank.org/access/German/
Klammler: http://childes.talkbank.org/access/Biling/Klammler.html
Koroschetz: http://childes.talkbank.org/access/Biling/Koroschetz.html

CHILDES English

http://childes.talkbank.org/access/German/
Klammler: http://childes.talkbank.org/access/Biling/Klammler.html
Koroschetz: http://childes.talkbank.org/access/Biling/Koroschetz.html

CHILDES Estonian

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/ (section Estonian)

CHILDES Farsi (Persian)

More information can be found at http://childes.talkbank.org/access/Other/Farsi/Family.html

CHILDES French

More information about particular collections can be found at:

Rondall http://childes.talkbank.org/access/French/Rondal.html
Vioncolas http://childes.talkbank.org/access/French/VionColas.html

CHILDES Gaelic (Irish)

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Celtic/Irish/Guilfoyle.html

CHILDES German

More information about particular collections of the corpus can be found at

http://childes.talkbank.org/access/German/

and

Klammler collection: http://childes.talkbank.org/access/Biling/Klammler.html
Koroschetz collection: http://childes.talkbank.org/access/Biling/Koroschetz.html

CHILDES Hebrew

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/ (in the section Hebrew)

CHILDES Hungarian

More information about particular collections of the corpus can be found at

http://childes.talkbank.org/access/Other/ (in the section Hungarian)
http://childes.talkbank.org/access/narrative.html (MacBates collections)

CHILDES Italian

More information about particular collections can be found at:

http://childes.talkbank.org/access/Romance/
http://childes.talkbank.org/access/narrative.html

CHILDES Japanese

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Japanese/

CHILDES Korean

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/EastAsian/Korean/Jiwon.html

CHILDES Norwegian

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Scandinavian/Norwegian/Simonsen.html

CHILDES Polish

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Slavic/Polish/Szuman.html

CHILDES Portuguese

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Romance/ (section Portuguese)

CHILDES Russian

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Slavic/Russian/Protassova.html

CHILDES Spanish

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Spanish/

CHILDES Swedish

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/

CHILDES Tamil

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Other/Tamil/Narasimhan.html

CHILDES Thai

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/

CHILDES Turkish

More information about particular collections of the corpus can be found at http://childes.talkbank.org/access/Frogs/ (search Turkish)

Search the CHILDES corpora

Sketch Engine offers a range of tools to work with the CHILDES corpora.

Other text corpora

Sketch Engine offers 800+ language corpora.

available corpora

about Sketch Engine

Use Sketch Engine in minutes

Generate collocations, frequency lists, examples in contexts, n-grams or extract terms. Use our Quick Start Guide to learn it in minutes.

Quick Start Guide

Availability

The overview of CHILDES corpora in Sketch Engine

Search the CHILDES corpora

Other text corpora

Use Sketch Engine in minutes

for learners of languages

A Course in Lexicography and Lexical Computing

term extraction

learn sketch engine

CHILDES – child language corpus

Availability

The overview of CHILDES corpora in Sketch Engine

Search the CHILDES corpora

Other text corpora

Use Sketch Engine in minutes

for learners of languages

A Course in Lexicography and Lexical Computing

term extraction

learn sketch engine