Corpus Gesproken Nederlands

(130595 corpus graphs)

1. General information

Name: Corpus Gesproken Nederlands
ID: CGN
Format: NeGra format, version 3

2. Corpus details

Features (T): word, pos, morph
Features (NT): cat
Labelled edges: yes
Crossing edges: yes
Secondary edges: yes

3. Statistical information

Number of corpus graphs: 130595
Number of tokens: 1142444
Average number of tokens: 8.7
Number of inner nodes: 625120
Number of edges: 1766873

4. Feature documentation

Feature values: cat

Feature values: pos

Feature values: morph

Edge labels

Secondary edge labels