International Workshop Towards a Research Infrastructure for Language Resources
|home | background | program | important dates | submission of papers | location | organizers | program committee | questions|
!!! Changed deadline for Abstract Submissions: February
17th 2006 !!!
Many teams are
working hard on establishing a sound framework for eHumanities where
language resources play a fundamental and enabling role both with
language as object of research and language as carrier of meaning.
The future researcher wants to interact with an integrated and interoperable
domain of language resources that is persistent, accessible and extendable.
Here, language resources are meant in the more general sense, i.e.
they cover data resources (texts of different sorts, annotated multimedia
recordings, lexica, grammars, geographical databases etc), tools (aligners,
annotators, parsers, taggers, meaning extractors etc) and knowledge
sources (metadata, data category registries, relation registries and
ontologies). Only a solid and sustainable research infrastructure
breaking national boundaries will help us to realize the researcher’s
dream. Persistence will be of crucial importance, since researchers
will only invest time if they see potential benefits.
All are built
upon strong international backbone network infrastructures, emerging
Grid middleware and common standards and frameworks such as XML, RDF
and web services. In addition we can refer to national formation processes
that will form the pillars for a sustainable international research
infrastructure. In Europe for example we can refer to AHDS (UK), DANS
(NL) and CNRS-eScience (FR) as examples for national centers for the
Language Resource Centers
resource centers are the key pillars for such research infrastructures.
They can be digital archives that, by their nature, should be based
on principles and technologies that enable digital sustainability,
such as: (1) Web-accessible metadata standards for resource management
and cataloguing (2) Separation of the mutable physical structure from
the logical one relevant for researchers; (3) Preservation of bit-stream
representations by regular migration to new technology and by distributing
them; (4) Facilities to allow interested and qualified researchers
to add new data or upload new versions of existing data; (5) Easy
and flexible user access to the resources; and (6) Utilization frameworks
that take into account the heterogeneity of the resources in terms
of linguistic data types, structural differences and differences in
linguistic terminology. But there can be other centers that maintain
registries of useful components, schemas and tools.
As well as addressing questions as to what the organizational pillars of research infrastructures and the exact identity of federations of language resource centers and archives might be, the workshop will discuss and share information about technologies that can help in setting up and managing large research infrastructures for language resources. All technologies that are important and currently being tested out in European or international projects should be critically discussed to understand their potential and state of maturity. Some time will be devoted to discussing roadmap issues.
 DELAMAN = Digital Endangered Languages and Music Archives Network
 ISO TC37/SC4 Management of Language Resources
We would like to invite the submission of papers that have the potential to contribute to building a stable and sustainable research infrastructure for language resources, with digital archives as one of their key pillars. A number of keywords may indicate the scope of the workshop:
We would like to motivate contributors to give summarizing papers that compare existing technologies and organizational frameworks based on project experience and that evaluate them with respect to their suitability for stable and sustainable research infrastructures and archive federations in the language resource domain as part of future eHumanities.
We expect extended abstracts of about 1000 words to be submitted as
WORD, PDF or ASCII documents per email to the following address: “email@example.com”.
The final papers should not have more than 4 pages.
|Magazzini del Cotone Conference Center, Genoa, Italy|
|Questions about the workshop can be addressed to: firstname.lastname@example.org|
background | program
| important dates | submission
of papers | location |
organizers | program
committee | questions