INESS-logo
Metadata

Treebanks

Tools


NorGrambank children's fiction in Norwegian Nynorsk
Full metadata record:
Persistent identifier for the resource:
Contact Person: Rosén, Victoria
This resource is licensed under the following terms:
CLARIN_ACA
BY ID NORED
BY ID NORED
Please click on the link to read the license terms.
You need to login via an approved identity provider (Feide, eduGAIN) to be able to access this resource.
If this is not possible for you, please register an CLARIN IdP account at https://user.clarin.eu/user.
If you do not have academic entitlement you will also have to apply for access on this page.
When you have registered your CLARIN IdP account please log in using the CLARIN IdP link on top of the page. Only then we are able to give you access to the corpus.
Attribution:
Please use the following text to cite this resource:
NorGrambank children's fiction in Norwegian Nynorsk. Created by Infrastructure for the Exploration of Syntax and Semantics. Distributed by the INESS Portal: hdl:11495/D963-33EA-65BD-0
Size: 106434 sentences , 1043260 words
Language(s): Norwegian (no), Norwegian Nynorsk (nn)
Description:
The treebank "NorGrambank children's fiction in Norwegian Nynorsk" is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata).
As of October 2015, the treebank comprises 106434 sentences, 1043260 words, 76 documents.
The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing.