Displaying 1 - 4 of 4
-
Van den Heuvel, H., Oostdijk, N., Rowland, C. F., & Trilsbeek, P. (2022). The CLARIN Knowledge Centre for Atypical Communication Expertise. In D. Fišer, & A. Witt (
Eds. ), CLARIN: The Infrastructure for Language Resources (pp. 373-388). Berlin, Boston: De Gruyter.Abstract
In this chapter we introduce the CLARIN Knowledge Centre for Atypical Communication Expertise. The mission of ACE is to support researchers engaged in languages which pose particular challenges for analysis; for this, we use the umbrella term “atypical communication”. This includes language use by second-language learners, people with language disorders or those suffering from lan-guage disabilities, and languages that pose unique challenges for analysis, such as sign languages and languages spoken in a multilingual context. The chapter presents details about the collaborations and outreach of the centre, the services offered, and a number of showcases for its activities. -
Klamer, M., Trilsbeek, P., Hoogervorst, T., & Haskett, C. (2017). Creating a Language Archive of Insular South East Asia and West New Guinea. In J. Odijk, & A. Van Hessen (
Eds. ), CLARIN in the Low Countries (pp. 113-121). London: Ubiquity Press. doi:10.5334/bbi.10.Abstract
The geographical region of Insular South East Asia and New Guinea is well-known as an
area of mega-biodiversity. Less well-known is the extreme linguistic diversity in this area:
over a quarter of the world’s 6,000 languages are spoken here. As small minority languages,
most of them will cease to be spoken in the coming few generations. The project described
here ensures the preservation of unique records of languages and the cultures encapsulated
by them in the region. The language resources were gathered by twenty linguists at,
or in collaboration with, Dutch universities over the last 40 years, and were compiled and
archived in collaboration with The Language Archive (TLA) at the Max Planck Institute in
Nijmegen. The resulting archive constitutes a collection ofmultimediamaterials and written
documents from 48 languages in Insular South East Asia and West New Guinea. At TLA,
the data was archived according to state-of-the-art standards (TLA holds the Data Seal of
Approval): the component metadata infrastructure CMDI was used; all metadata categories
as well as relevant units of annotation were linked to the ISO data category registry ISOcat.
This guaranteed proper integration of the language resources into the CLARIN framework.
Through the archive, future speaker communities and researchers will be able to extensively
search thematerials for answers to their own questions, even if they do not themselves know the language, and even if the language dies. -
Wittenburg, P., & Trilsbeek, P. (2010). Digital archiving - a necessity in documentary linguistics. In G. Senft (
Ed. ), Endangered Austronesian and Australian Aboriginal languages: Essays on language documentation, archiving and revitalization (pp. 111-136). Canberra: Pacific Linguistics. -
Wittenburg, P., Trilsbeek, P., & Lenkiewicz, P. (2010). Large multimedia archive for world languages. In SSCS'10 - Proceedings of the 2010 ACM Workshop on Searching Spontaneous Conversational Speech, Co-located with ACM Multimedia 2010 (pp. 53-56). New York: Association for Computing Machinery, Inc. (ACM). doi:10.1145/1878101.1878113.
Abstract
In this paper, we describe the core pillars of a large archive oflanguage material recorded worldwide partly about languages that are highly endangered. The bases for the documentation of these languages are audio/video recordings which are then annotated at several linguistic layers. The digital age completely changed the requirements of long-term preservation and it is discussed how the archive met these new challenges. An extensive solution for data replication has been worked out to guarantee bit-stream preservation. Due to an immediate conversion of the incoming data to standards -based formats and checks at upload time lifecycle management of all 50 Terabyte of data is widely simplified. A suitable metadata framework not only allowing users to describe and discover resources, but also allowing them to organize their resources is enabling the management of this amount of resources very efficiently. Finally, it is the Language Archiving Technology software suite which allows users to create, manipulate, access and enrich all archived resources given that they have access permissions.
Share this page