DoReCo

DoReCo version 2.0 was published on 12 December 2024! After two years of intense work, this update brings new datasets, substantial improvements to the consistency of annotations and metadata, and a myriad of smaller changes and bug fixes.

DoReCo 2.0 hosts annotated speech data from 53 low-resource and endangered languages from all inhabited continents, inviting cross-linguistic research into phonetics, phonology, and morphology.

DoReCo (Language Documentation Reference Corpus) is jointly edited by Frank Seifart, Ludger Paschen, and Matt Stave. The bulk of the update from v.1.2 to v.2.0 was developed within the AIRAL project at Leibniz-Centre General Linguistics (ZAS).

Check out the corpus website at https://doreco.huma-num.fr!



AIRAL

AIRAL is an ongoing project (2022-2025) funded by the German Research Foundation (DFG). AIRAL (Acoustic Insights into the Root-Affix asymmetry across Languages) has the goal to shed light on the acoustic properties of roots and affixes in a world-wide sample of 40 languages. The project draws upon a combination of morphological and phonetic time alignments provided by the DoReCo corpus, which allow to study the effects of morphological structure on fine phonetic detail (duration, spectral properties) across languages with vastly different sound inventories and levels of morphological synthesis.

As of December 2024, the main outcomes of AIRAL are:

  1. Publication of DoReCo 2.0 in December 2024
  2. Study on fine differences in acoustic duration between homophonous morphs (currently under review)
  3. Study on wordhood and prosodic detachability of affixes (currently under review)
  4. Study on cross-linguistic differences and commonalities in rhythm (currently under review)
  5. Handbook article on phonetic corpora (currently under review)
  6. Study on word-initial consonants, published in Nature Human Behaviour in September 2024



The AIRAL team consists of:

Collaborators:

Former project members: