INESS-logo
Metadata

Treebanks

Tools


Icelandic Parsed Historical Corpus
Full metadata record:
Persistent identifier for the resource:
Contact Person: Rosén, Victoria
This resource is licensed under the following terms:
Lesser General Public License (LGPL)
BY SA
BY SA
Please click on the link to read the license terms.
By accepting the terms of the license you will be granted access to the resource.
Attribution:
Wallenberg, Joel, Anton Karl Ingason, Einar Freyr Sigurðsson and Eiríkur Rögnvaldsson. 2011. Icelandic Parsed Historical Corpus (IcePaHC). Version 0.9. http://www.linguist.is/icelandic_treebank
Size: 73014 sentences , 1057182 words
Language(s): Icelandic (is)
Description:
About 1000000 words of Icelandic text, from every century between the
12th and the 21st centuries inclusive annotated for phrase structure,
part-of-speech-tagged and lemmatized.

A copy of the treebank is searchable via the INESS portal. The original is downloadable on a LGPL license, see elsewhere in the metadata for a link.