Data curation for a VALID archive of Dutch language impairment data
The VALID Data Archive is an open multimedia data archive in which data from children and adults with language and/or communication problems are brought together. A pilot project, funded by CLARIN-NL, was carried out in which five existing data sets were curated. This pilot enabled us to build up experience in conserving different kinds of pathological language data in a searchable and persistent manner. These data sets reflect current research in language pathology rather well, both in the range of designs and the variety in pathological problems, such as Specific Language Impairment, deafness, dyslexia, and ADHD. In this paper, we present the VALID initiative, explain the curation process and discuss the materials of the data sets.
Publication typeJournal article