History: Corpora and corpus tools at the Max Planck Institute

EUDICO is the culmination of years of experience with corpora and software tools operating on those corpora. Existing corpora and corresponding tools from outside the institute were used extensively. Several corpora and toolsets were constructed in-house.

Most used external corpora and tools:

In-house corpora, corpus tools and other linguistic resources:

The development of EUDICO started late 97 with a pilot to check the feasibility of realizing our concepts on basis of Java and the Java Media Framework (JMF). Focus for this pilot was the possibility to synchronize text presentation with playback of MPEG-1 video. After successful completion of the pilot the actual project started beginning 98, resulting in the public availability of a read-only demonstration version in the summer of 99. In this version all initial project aims, except support for distributed editing, were successfully realized.

Our goal is to have a full-fledged linguistic toolbox for work on multi-modal corpora in 2000.

During the first half of 99 a pilot project was successfully concluded   with the aim to check out the possibility of making a sensible merge of the EUDICO system with components from Sheffield University's  GATE system.