DoReCo version 2.0 was published on 12 December 2024! After two years of intense work, this update brings new datasets, substantial improvements to the consistency of annotations and metadata, and a myriad of smaller changes and bug fixes.
DoReCo 2.0 hosts annotated speech data from 53 low-resource and endangered languages from all inhabited continents, inviting cross-linguistic research into phonetics, phonology, and morphology.
DoReCo (Language Documentation Reference Corpus) is jointly edited by Frank Seifart, Ludger Paschen, and Matt Stave. The bulk of the update from v.1.2 to v.2.0 was developed within the AIRAL project at Leibniz-Centre General Linguistics (ZAS).
Check out the corpus website at https://doreco.huma-num.fr!AIRAL is an ongoing project (2022-2025) funded by the German Research Foundation (DFG). AIRAL (Acoustic Insights into the Root-Affix asymmetry across Languages) has the goal to shed light on the acoustic properties of roots and affixes in a world-wide sample of 40 languages. The project draws upon a combination of morphological and phonetic time alignments provided by the DoReCo corpus, which allow to study the effects of morphological structure on fine phonetic detail (duration, spectral properties) across languages with vastly different sound inventories and levels of morphological synthesis.
As of December 2024, the main outcomes of AIRAL are:
The AIRAL team consists of:
Collaborators:
Former project members: