International Corpus of English (ICE)
Introduction | References | Corpus Structure | Corpus Information |
Document Information | Header Information | Metadata Overview |
Last update: 13-Sep-2000
The International Corpus of English began in 1990 with the primary aim of providing material for comparative studies of varieties of English throughout the world. Twenty centers around the world are preparing corpora of their own national or regional variety of English.
International Corpus of English Corpus Utility Programme (ICECUP). Information about metadata comes from the ICECUP online help manual (section: sociolinguistic variables).
The corpus is divided in several categories
(see http://www.ucl.ac.uk/english-usage/ice/design.htm)
Text code | The code number indicates the text category of the text within the hierarchy of the corpus as a whole |
Speakers per text | The number of speakers, including extra-corpus speakers, in dialogues |
Speakers Age | Age or age range of a speaker or author. The following ranges are used: 18-25, 26-45, 46-65, 66+ |
Speakers Gender | Speaker or author’s gender |
Speakers Education | Contains the speakers education (secondary, university). Secondary applies to those who have completed at most second-level schooling. University applies to those who have completed a course of tertiary education, though not necessarily at a University per se |
Speakers role | The communicative role of a speaker in an exchange, e.g. interviewer, chairman |
Medium | Indicates whether a broadcast was transmitted via TV or radio |
Scope | The scope of a newspaper, e.g. local, national |
Frequency | The frequency of publication of a newspaper, e.g. daily, weekly |
Circulation | An approximate circulation figure for a newspaper at the time of the publication of the text |