INESS-logo
Metadata

Treebanks

Tools


NorGramBank children’s fiction in Norwegian Bokmål
Full metadata record:
Persistent identifier for the resource:
Contact Person: Rosén, Victoria
This resource is licensed under the following terms:
CLARIN_ACA
BY ID NORED
BY ID NORED
Please click on the link to read the license terms.
You need to login via an approved identity provider (Feide, eduGAIN) to be able to access this resource.
If this is not possible for you, please register an CLARIN IdP account at https://user.clarin.eu/user.
If you do not have academic entitlement you will also have to apply for access on this page.
When you have registered your CLARIN IdP account please log in using the CLARIN IdP link on top of the page. Only then we are able to give you access to the corpus.
Attribution:
Please use the following text to cite this resource:
NorGramBank children’s fiction in Norwegian Bokmål. Created by Infrastructure for the Exploration of Syntax and Semantics. Distributed by the INESS Portal: hdl:11495/D988-1F83-B1F5-1
Size: 389564 sentences , 4111212 words
Language(s): Norwegian (no), Norwegian Bokmål (nb)
Description:
The "NorGramBank children’s fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata).

As of October 2015, the treebank comprises 389564 sentences, 4111213 words and 155 documents.
The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing.