2. Proceedings
Currently, we can identify a number of trends in the community dealing with multimodal/multimedia language resources:
The number of resources is increasing rapidly;
Multimedia extensions and rich annotation are making the resources more complex;
Multimedia extensions are making the resources much larger, requiring new modes of storage.
These resources will become accessible via the Internet.
The joint EC/NSF funded EAGLES/ISLE1 initiative - as discussed in the white
paper - aims to create standards and guidelines
that can be applied to natural interactivity and multimodal language resources
(e.g. speech, gesture, facial expressions, sign languages) that support the
creation, use, re-use of and access to such resources. As part of this
initiative, the workshop will address current trends and discuss structures
which could simplify and assist the creation and use of annotated
multimodal/multimedia resources, the process of finding suitable resources, and
accessing them, for instance, via the Web. The workshop will address three related
areas: annotation schemas, meta-descriptions for multimodal/multimedia
language resources and tools and annotation environments.
It is time to parallel the metadata activities occurring in the various groups by bringing the far-flung community of multimedia language resource users together to start a discussion about the meta schemas to describe their resources. The goal is to be able to add linked meta-descriptions to the available multimedia language resources to form a browsable and searchable universe open to the Internet. A known portal, standardised meta-descriptions and suitable tools would make it easier for users to find the right resources for the task at hand. This interest unifies people from science, industry, and the wider community who have to use annotated multimedia resources for scientific analysis, training or commercial applications.
Part of the proposed workshop will be dedicated to discussing the need for such a universe of linked meta-descriptions, the scope of the community, and existing work in this area. The nature of the meta-descriptions has to be discussed in detail with an emphasis on questions such as: (1) Which are the elements which describe the various language resources? (2) Is a minimal schema preferred or do we need something extensible (3) How can we achieve flexibility within the standard meta-description? (4) How can we automatically derive meta-descriptions to make general annotation feasible?
The workshop will also discuss whether benefits can be taken from existing standards such as Dublin-Core from the community of digital libraries, whether initiatives in the telecommunication and broadcasting community like the the W3C Resource Description Framework are of relevance for our goals.
A second session of the workshop will be dedicated to discussing annotation schemas for multimodal/multimedia language resources. Until now the community has largely worked with text-only corpora based on orthographical transcriptions (with all their limitations) and with corpora covering speech data typically associated with one layer of orthographic transcription specifically tailored to the needs of Automatic Speech Recognition systems. As computers have become more powerful, people have started to build corpora based on several video and sound tracks with rich multi-layer - up to 50 and more - annotation. The layers of annotation can have complex time relationships and intricate dependencies between and within layers. It seems clear that many such complex structured corpora will be created and that the community needs guidelines to restrict the heterogeneity of such corpora.
At the Granada LREC conference we heard about initial projects having implemented "Abstract Data Models" for such multimedia corpora. In the meantime a broad discussion about the underlying universal structure for such annotations has also been initiated. A number of projects in the US and Europe have been funded to develop annotation and exploitation tools to cope with complex multimedia databases. A specialised workshop dedicated to annotation schemas is now over-due if we want to get good interoperability between resources and unified access to resources. Without an agreed standard for for annotation schemas we risk an explosive proliferation of the access tools needed to exploit such databases.
The emergence of multimedia on computers makes it possible to supercede the traditional approach, because direct media access allows us to refer to media time which will never change instead of referring to transcriptions which can be modified and are often inadequate for coding complex time relationships.
The session will not only address theoretical matters such as the underlying common structure and abstract data models, but will also discuss suitable representation formats important for implementation. Formats suitable for open exchange and long-term archiving will not be the optimal choice for all types of program access and vice versa. We expect that modern tools will have to handle several co-existing representation formats. We also have to deal with the question of how we can integrate existing text-based corpora, or corpora which are progressively annotated after collection
There are a number of annotation schemes in general use, and there is a requirement for tools that can handle assemblies of language resources where individual resources don't all use the same scheme. Access to large corpora becomes impractically cumbersome without specialised tools and data structures. The session will report on current activities in this area.
Approaches to the fundamental design of the formats, data, and tools are varied among current systems for the annotation and exploitation of linguistic corpora. A primary reason for this diversity is that most developers are concerned with only one aspect of the creation/annotation/exploitation process. However, in order to work effectively toward commonality, the phases of the process must be considered as a whole. This demands bringing together researchers and developers from a variety of domains in text, speech, video, etc., many of whom have previously had little or no contact.
This workshop is intended to bring these groups together to look broadly at the technical issues that bear on the development of software systems for the annotation and exploitation of linguistic resources. The goal is to lay the groundwork for the definition of a data and system architecture to support corpus annotation and exploitation that can be widely adopted within the community. Among the issues to be addressed are:
A "Birds of a Feather" session for those interested in the ANC project will be held immediately following the workshop.
The workshop organizers have produced proceedings. We have received final versions of most of the papers accepted for the workshop, and these were sent to the printers on May18th. These are available on the program page.
H. Cunningham, Department of Computer Science, University Sheffield
D. Roy, Natural Interactive Systems Laboratory, Faculty of Science and Engineering, University of Southern Denmark Odense
P. Wittenburg, Technical Department, Max-Planck-Institute for Psycholinguistics, Nijmegen
N.O. Bernsen (U Odense)
S. Bird (U Penn)
P. Bonhomme (LORIA Nancy)
D. Broeder (MPI Nijmegen)
H. Brugman (MPI Nijmegen)
L. Burnard (U Oxford)
N. Calzolari (ILC Pisa)
K. Choukri (ELRA Paris)
B. Comrie (MPI Leipzig)
H. Cunningham (U Sheffield)
T. Hanke (U Hamburg)
U. Heid (U Stuttgart)
N. Ide (Vassar College)
J. Kegl (U Southern Maine)
T. McEnery (U Lancaster)
D. McNeill (U Chicago)
B. MacWhinney (CMU Pitsburgh)
J. Mariani (LIMSI Paris)
J.C. Martin (LIMSI Paris)
L. Noldus (Noldus Wageningen)
S. Piperides (ILSP Athens)
W. Peters (U Sheffield)
R. Posner (TU Berlin)
L. Romary (LORIA Nancy)
A. Russel (MPI Nijmegen)
D. Roy (U Odense)
D. Slobin (U Berkeley)
S. Steininger (U München)
S. Stromqvist (U Lund)
H. Thompson (HCRC Edinburgh)
M. Turk (Microsoft)
Y. Wilks (U Sheffield)
P. Wittenburg (MPI Nijmegen)
A. Zampolli (ILC Pisa)
PART 1:
Meta-Descriptions for Multimodal/Multimedia Language
Resources
This table lists times, speakers and titles; click on
the speaker name to get the abstract;
this usually gives all the authors and their affiliations.
Times for coffee breaks may change.
14:30 | 14:50 | Wittenburg | Meta-Descriptions for Language Resources |
14:50 | 15:20 | Thompson | All Data is Meta-Data: Rich Architectures for Rich Resources (link to slides). |
15:20 | 15:40 | Heid | Querying Meta and Object Data - Problems and Elements of Solutions |
15:40 | 16:00 | Stromqvist | Optional extensions - a proposal for a flexible annotation system |
16:00 | 16:20 | Oostdijk | Meta-Data in the Dutch Spoken Corpus Project |
16:20 | 16:40 | Coffee Break | . |
16:40 | 17:00 | Suihkonen | On Meta Descriptions for Cross-Linguistic Electronic Linguistic Data |
17:00 | 17:20 | Broeder | A Browseable Corpus: Accessing linguistic resources the easy way |
17:20 | 17:40 | Choukri | Meta-Data from ELRA Perspective |
17:40 | 18:00 | Discussion + Summary | . |
PART 2:
Annotation Schemes for Multimodal/Multimedia Language
Resources
9:00 | 9:20 | Wittenburg | Terminology for Annotation Schemes |
9:20 | 9:40 | Martin | Types of Cooperation and Referenceable Objects |
9:40 | 10:00 | Steininger | Transliteration of Language and Labeling of Emotion and Gestures in SMARTKOM |
10:00 | 10:20 | Villasenor | A Multimodal Dialogue Contribution Coding Scheme |
10:20 | 10:40 | Salmon-Alt | Increasing the Genericity of the MATE Annotation Framework |
10:40 | 11:00 | Delmonte | Towards an annotated Database for Anaphora Resolution |
11:00 | 11:30 | Coffee Break | . |
11:30 | 11:50 | Brugman | The EUDICO project, multi-media annotation over the Internet |
11:50 | 12:10 | Ghorbel | Semi-Automatic Annotation of Multimedia Documents via Adaptive Interfaces |
12:10 | 12:30 | Vollmann | Annotation of Sound(/video) data in the Multimedia Language Documentation and Language Research Laboratory |
PART 3:
Data Architectures and Software Support for Large Corpora
14:30 | 14:50 | Ide | Requirements, Tools and Architectures for Annotated Corpora |
14:50 | 15:10 | Bird | ATLAS: A flexible and extensible Architecture for Linguistic Annotation |
15:10 | 15:30 | Simons | Cellar: A data modeling system for linguistic annotation |
15:30 | 15:50 | Folch | Semantic Tagging of a Corpus using the Topic Navigation Map Standard |
15:50 | 16:10 | Romary | A Framework for multi-level linguistic annotation |
16:10 | 16:30 | Coffee Break | . |
16:30 | 16:50 | Ide | The XML Framework and Its Implications for Corpus Access and Use |
16:50 | 17:10 | Dybkjaer | The MATE Workbench |
17:10 | 17:30 | Fafiotte | A simulation and collection platform on the Internet for multi-modal translated spoken dialogues |
17:30 | 18:30 | Ide&Wittenburg | Summarizing Panel and Discussion Subject are all three parts of the EAGLES/ISLE workshops |
1. International Standard in Language Engineering project funded by EC and NSF. (back to reference)