The root element for IMDI descriptions
Instantiation of a VocabularyDef_Type
Revision history of the metadata description
Information on creation location for this data
The name of a continent
The name of a country
The name of a geographic region
The address
List of a number of key name value pairs. Should be used to add information that is not covered by other metadata elements at this level
Groups information about the languages used in the session
Description for the list of languages spoken by this participant
Groups information about access rights for this data
Availability of the data
Date when access rights were evaluated
Name of owner resource
Publisher responsible for distribution of this data
Resource is preferably a metadata resource. In the case of a well-defined merged metadata/content format such as TEI or legacy resources for which no further metadata is available it is the resource itself. If the external resource is an IMDI session with written resources Type & SubType will be the same as the Type & SubType of the primary written resource in that session. If it is a session with IMDI multi-media resources the Type of the Media
File will designate it. SubType is used only for written resources. Non-IMDI metadata resource types need to be mapped to IMDI types
The type of the external (metadata) resource
The sub type of the external (metadata) resource. Only used in case its metadata for a written resource
The metadata format
The URL of the external metadata record
Project Information
A short name or abbreviation for the project
The full title of the project
A unique identifier for the project
Contact information for this project
Description for this project
Type for group of metadata pertaining to a session
Groups information about the location where the session was created
Groups information about the project for which the session was (originally) created
Project keys
Groups information about the content of the session. The content description takes place in several (overlapping) dimensions
Groups information about all actors in the session
Major genre classification
Sub genre classification
List of he major tasks carried out in the session
List of modalities used in the session
Classifies the subject of the session. Uses preferably an existing library classification scheme such as LCSH. The element has a scheme attribute that indicates what scheme is used. Comments: The element can be repeated but the user should guarantee consistency
This groups information concerning the context of communication
degree of interactivity
Degree of planning of the event
Indicates in how far the researcher was involved in the linguistic event
Indicates the social context the event took place in
Indicates the structure of the communication event
Indicates the channel of the communication
Description for the content of this session
Description about the actors as a group
Group of actors
Functional role of the actor e.g. consultant, contributor, interviewer, researcher, publisher, collector, translator
Name of the actor as used by others in the transcription
Official name of the actor
Short unique code to identify the actor as used in the transcription
The family social role of the actor
The actor languages
The ethnic groups of the actor
The age of the actor
The birthdate of the actor
The sex of the actor
The education of the actor
Indicates if real names or anonymized codes are used to identify the actor
Contact information of the actor
Actor keys
Description for this individual actor
Type for a corpus that points to either other corpora or sessions
Name of the (sub-)corpus
Title for the (sub-)corpus
Description of the (sub-)corpus
Link to other resource. Attribute name is for the benefit of browsing
Type for group metadata pertaining to published corpora
Name of the published corpus
Title of the published corpus
Identifier of the published corpus
Description of the published corpus
The languages used for documentation of the corpus
Description for the list of languages
The languages in the corpus that are subject of analysis
Description for the list of languages
Content type of the published corpus
Publisher responsible for distribution of the published corpus
Authors for the resources
Human readabusle string that indicates total size of corpus
Pricing info of the corpus
Person to be contacted about the resource
URL to the resource
URL to the metadata for the resource
List of any publications related to the resource
Groups information of language resources connected to the session
Groups all media resources
Groups information about a Written Resource
Groups information only pertaining to a Lexical resource
Groups information only pertaining to a lexiconComponent
Groups information about the source; e.g. media-carrier, book, newspaper archive etc.
Groups data about name conversions for persons who are anonymised
Groups information about external documentation associated with this session
Every description is a reference
Groups information about the media file
URL to media file
Major part of mime-type
Minor part of mime-type
Size of media file
Quality of the recording
describes technical conditions of recording
Groups information about a Written Resource
URL to file containing the annotations/transcription
URL to media file from which the annotations/transcriptions originate
Date when Written Resource was created
The type of the WrittenResource
The subtype of the WrittenResource
File format used for Written Resource
The size of the Written Resource file. Integer value with addition of M (mega) or K (kilo)
How this document relates to another resource
Character encoding used in the written resource
Content encoding used in the written resource
Language used in the resource
Indicates if data has been anonymised. CV boolean
Groups information only pertaining to a Lexical resource
URL to lexical resource
Date when lexical resource was created
The type of the WrittenResource
The format of the LexicalResource
The character encoding of the LexicalResource
The size of the LexicalResource in bytes
The number of head entries of the LexicalResource
The number of sub entries of the LexicalResource
OCV: Sentence, Phrase, Wordform, Lemma, ...
OCV: HyphenatedSpelling, SyllabifiedSpelling, ...
OCV: Stem,StemALlomorphy, Segmentation, ...
OCV: POS, Inflexion, Countability, ...
OCV: Complementation, Alternation, Modification, ...
OCV: Transcription, IPA Transcription, CV pattern, ...
OCV: Sense dstinction
A block to describe the languages that are used to define terms, to describe meaning
Groups information only pertaining to a lexiconComponent
URL to lexiconComponent
Date when lexiconComponent was created
The type of the lexiconComponent
The format of the lexiconComponent
The character encoding of the lexiconComponent
The size of the lexiconComponent in bytes
Describes the tree in which the component can be embedded
Describes the possible parents of the lexiconComponent in the schema tree
Descibes the preferred parent of the lexiconComponent in the schema tree
Describes the possible component children of the lexiconComponent in the schema tree
Describes the possible category children of the lexiconComponent in the schema tree
Gives information on the lexical applications of the lexiconComponent
Describes whether the lexiconComponent can be used to add orthography to the lexicon schema
Describes whether the lexiconComponent can be used to add morphology to the lexicon schema.
Describes whether the lexiconComponent can be used to add morphosyntactic features to the lexicon schema
Describes whether the lexiconComponent can be used to add syntactic features to the lexicon schema
Describes whether the lexiconComponent can be used to add phonology to the lexicon schema.
Describes whether the lexiconComponent can be used to add a semantic element to the lexicon schema
A block to describe the languages that are used to define terms, to describe meaning
Groups information about the original source; e.g. media-carrier, book, newspaper archive etc.
Unique code to identify the original source
Physical storage format of the source
Quality of original recording
Description for the original source
Groups data about name conversions for persons who are anonymised
URL to information to convert pseudo named to real-names
The definition of a vocabulary. Attributes: Date of creattion, Link to origin. Contails a Description be element to descr+++ ibe the domain of the vocabulary and a (unspecified) number of value enries
Human readable description in the form of a text with language id specification and/or a link to a file with a description and language id specification. The name attribute is to name the link (if present)
Contact information for this data
The validation used for the resource
CV: content, type, manual, automatic, semi-automatic
Validation methodology
Percentage of resource validated
Specifies age of a person with differerent counting methods
Specifies age of a person in the form of a range
An element from a set of languages used in the session
Unique code to identify a language
Name of the language
Is it the speakers mother tongue. Only applicable if used in the context of a speakers language
Is it the speakers primary language. Only applicable if used in the context of a speakers language
Is it the most frequently used language in the document. Only applicable if used in the context of the resource's language
Direction of translation. Only applicable in case it is the context of a lexicon resource
Direction of translation. Only applicable in case it is the context of a lexicon resource
Description for this particular language
Indicates if language is dominant language
Indicates if language is source language
Indicates if language is target language
Description of the language
Information on language name and id
Unique code to identify a language
The name of the language
String type for single spaced, single line strings
Comma separated string
The age of a person
The age of a person given as a range
The age counting method
Vocabulary content and attributes
Link to a vocabulary definition
Position (start (+end) ) on a old fashioned tape without time indication
Position in a media file or modern tape
The start time position of a recording
The end time position of a recording
Quality indication
Unspecified is a non-existing (null) value. Unknown is a informational value indicating that the real value is not known
empty string definition
Comma seperated string
Loose boolean value where empty values are allowed
integer + Unspecified and Unknown
Defines a date that can also be empty or Unknown or Unspecified
Defines a date range that can also be Unspecified or Unknown
Language identifiers
Time position in the hh:mm:ss:ff format
Quality values (1 .. 5) also allows empty values
All possible vocabulary type values
Allowed values for metadata transcripts
Attributes allowed for profiles