You are here: Home Resources Data and corpora

Data and corpora


This page gives an overview of the data archives and language corpora which can be accessed through the MPI:


Browsable corpora at the MPI

Language corpora of data collected within the framework of MPI projects or data that has been collected in earlier times and now stored in the MPI archives. more >


The DoBeS (Documentation of Endangered Languages) program is financed by the German Volkswagen Stiftung. The aim of the program is twofold: (1) to document languages which are at the edge of disappearing and (2) to provide a persistent and long  lasting archive of the documentation material. Currently over 40 languages are being documented and archived. more >

Geographical Browsing of language sites

Use geographic browsing to explore a collection of places representing various research locations of the Max Planck Institute for Psycholinguistics and other Linguistic Archives. Download the Google Earth Overlay

GEoverlay image

The Corpus NGT

The NGT is a collection data from deaf signers using Sign Language of the Netherlands (NGT). Data consist of recordings with multiple synchronized video cameras, accompanied by gloss and translation annotations. All data are freely accessible to researchers and the general public. The project is carried out by Onno Crasborn, Inge Zwitserlood and Johan Ros from the Radboud University. The data is stored in the MPI archive for linguistic resources. more >

Database of Dutch diphone perception

This database is described in: Smits, R., Warner, N., McQueen, J.M. & Cutler (2003), Unfolding of phonetic information over time: A database of Dutch diphone perception, Journal of the Acoustical Society of America, 113, 563-574. to the database >

The Fromkin speech error database

The Fromkin Speech Error Database was collected over many years, and was converted to computer-readable form at UCLA with support from a National Science Foundation grant to Professor Victoria A. Fromkin.

At the time of Vicki Fromkin's death in January 2000, the wider availability of the database was in doubt because there was no longer support for the software format used to encode it. more >


The Stern diaries

Clara and William Stern kept a diary (Tagebuch) on the psychological development of their three children, Hilde, Günther and Eva, born in 1900, 1902 and 1904 respectively. 

Last checked 2015-04-07 by Paul Trilsbeek
About MPI

This is the MPI

The Max Planck Institute for Psycholinguistics is an institute of the German Max Planck Society. Our mission is to undertake basic research into the psychological,social and biological foundations of language. The goal is to understand how our minds and brains process language, how language interacts with other aspects of mind, and how we can learn languages of quite different types.

The institute is situated on the campus of the Radboud University. We participate in the Donders Institute for Brain, Cognition and Behaviour, and have particularly close ties to that institute's Centre for Cognitive Neuroimaging. We also participate in the Centre for Language Studies. A joint graduate school, the IMPRS in Language Sciences, links the Donders Institute, the CLS and the MPI.



Street address
Wundtlaan 1
6525 XD Nijmegen
The Netherlands

Mailing address
P.O. Box 310
6500 AH Nijmegen
The Netherlands

Phone:   +31-24-3521911
Fax:        +31-24-3521213

Public Outreach Officer
Marjolein Scherphuis