The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcriptions of Norwegian parliamentary debates between 2008 and 2015, downloaded from https://data.stortinget.no/. Of the total set of documents, it is ongoing work to preprocess documents (e.g register previously unknown words in the document) and load the preprocessed documents into INESS for automatic parsing; hence, as of June 2016, the size of the treebank is still growing. To see the updated info on treebank size and which documents are included, please choose the relevant treebank, and then click "Treebank Details" (in the left-hand menu). Each sentence has the following metadata which is searchable in the INESS search system: (1) language variety - Norwegian bokmål (nob) or Norwegian nynorsk (nno), based on the automatic recognition of language variety, implemented by Paul Meurer at Uni Research Computing. There are also some transcriptions from speeches in English and Danish. (2) Speaker's name (3) Date and time (4) Political party to which the speaker belongs (5) Type of contribution (e.g. 'hovedinnlegg' [main contribution] or 'replikk' [reply]). AVAILABILITY: The material from 2008- 2015 is searchable via the corpus tool Corpuscle. Via the treebank portal INESS (clarino.uib.no/iness) you can search in sentence analyses from the material (for that set of documents that have currently been preprocessed and the automatically parsed). [less]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcr… [more]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcriptions of Norwegian parliamentary debates between 2008 and 2015, downloaded from https://data.stortinget.no/. Of the total set of documents, it is ongoing work to preprocess documents (e.g register previously unknown words in the document) and load the preprocessed documents into INESS for automatic parsing; hence, as of June 2016, the size of the treebank is still growing. To see the updated info on treebank size and which documents are included, please choose the relevant treebank, and then click "Treebank Details" (in the left-hand menu). Each sentence has the following metadata which is searchable in the INESS search system: (1) language variety - Norwegian bokmål (nob) or Norwegian nynorsk (nno), based on the automatic recognition of language variety, implemented by Paul Meurer at Uni Research Computing. There are also some transcriptions from speeches in English and Danish. (2) Speaker's name (3) Date and time (4) Political party to which the speaker belongs (5) Type of contribution (e.g. 'hovedinnlegg' [main contribution] or 'replikk' [reply]). AVAILABILITY: The material from 2008- 2015 is searchable via the corpus tool Corpuscle. Via the treebank portal INESS (clarino.uib.no/iness) you can search in sentence analyses from the material (for that set of documents that have currently been preprocessed and the automatically parsed). [less]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcr… [more]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcriptions of Norwegian parliamentary debates between 2008 and 2015, downloaded from https://data.stortinget.no/. Of the total set of documents, it is ongoing work to preprocess documents (e.g register previously unknown words in the document) and load the preprocessed documents into INESS for automatic parsing; hence, as of June 2016, the size of the treebank is still growing. To see the updated info on treebank size and which documents are included, please choose the relevant treebank, and then click "Treebank Details" (in the left-hand menu). Each sentence has the following metadata which is searchable in the INESS search system: (1) language variety - Norwegian bokmål (nob) or Norwegian nynorsk (nno), based on the automatic recognition of language variety, implemented by Paul Meurer at Uni Research Computing. There are also some transcriptions from speeches in English and Danish. (2) Speaker's name (3) Date and time (4) Political party to which the speaker belongs (5) Type of contribution (e.g. 'hovedinnlegg' [main contribution] or 'replikk' [reply]). AVAILABILITY: The material from 2008- 2015 is searchable via the corpus tool Corpuscle. Via the treebank portal INESS (clarino.uib.no/iness) you can search in sentence analyses from the material (for that set of documents that have currently been preprocessed and the automatically parsed). [less]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcr… [more]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcriptions of Norwegian parliamentary debates between 2008 and 2015, downloaded from https://data.stortinget.no/. Of the total set of documents, it is ongoing work to preprocess documents (e.g register previously unknown words in the document) and load the preprocessed documents into INESS for automatic parsing; hence, as of June 2016, the size of the treebank is still growing. To see the updated info on treebank size and which documents are included, please choose the relevant treebank, and then click "Treebank Details" (in the left-hand menu). Each sentence has the following metadata which is searchable in the INESS search system: (1) language variety - Norwegian bokmål (nob) or Norwegian nynorsk (nno), based on the automatic recognition of language variety, implemented by Paul Meurer at Uni Research Computing. There are also some transcriptions from speeches in English and Danish. (2) Speaker's name (3) Date and time (4) Political party to which the speaker belongs (5) Type of contribution (e.g. 'hovedinnlegg' [main contribution] or 'replikk' [reply]). AVAILABILITY: The material from 2008- 2015 is searchable via the corpus tool Corpuscle. Via the treebank portal INESS (clarino.uib.no/iness) you can search in sentence analyses from the material (for that set of documents that have currently been preprocessed and the automatically parsed). [less]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcr… [more]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcriptions of Norwegian parliamentary debates between 2008 and 2015, downloaded from https://data.stortinget.no/. Of the total set of documents, it is ongoing work to preprocess documents (e.g register previously unknown words in the document) and load the preprocessed documents into INESS for automatic parsing; hence, as of June 2016, the size of the treebank is still growing. To see the updated info on treebank size and which documents are included, please choose the relevant treebank, and then click "Treebank Details" (in the left-hand menu). Each sentence has the following metadata which is searchable in the INESS search system: (1) language variety - Norwegian bokmål (nob) or Norwegian nynorsk (nno), based on the automatic recognition of language variety, implemented by Paul Meurer at Uni Research Computing. There are also some transcriptions from speeches in English and Danish. (2) Speaker's name (3) Date and time (4) Political party to which the speaker belongs (5) Type of contribution (e.g. 'hovedinnlegg' [main contribution] or 'replikk' [reply]). AVAILABILITY: The material from 2008- 2015 is searchable via the corpus tool Corpuscle. Via the treebank portal INESS (clarino.uib.no/iness) you can search in sentence analyses from the material (for that set of documents that have currently been preprocessed and the automatically parsed). [less]
The treebank "Proceedings of Norwegian parliamentary debates (2008-2015)" is a collection of transcr… [more]
The "NorGramBank – Newspaper text (years 2012, 2013) in Norwegian Bokmål from the Norwegian Newspaper Corpus" treebank is a syntactically annotated corpus based on data taken from the years 2012 and 2013 from the Norwegian Newspaper Corpus (NCC). This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 246397 sentences, 3157558 words and 1543 documents. Note that the available treebank contains only those newspaper articles from 2012 and 2013 that have been manually preprocessed; see details otherwheres in the metadata. [less]
The "NorGramBank – Newspaper text (years 2012, 2013) in Norwegian Bokmål from the Norwegian Newspape… [more]
The "NorGramBank children’s fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 389564 sentences, 4111213 words and 155 documents. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank children’s fiction in Norwegian Bokmål" treebank is a syntactically annotated corpu… [more]
The "NorGram Non-fiction text in Norwegian Bokmål from Forskning.no" treebank is a syntactically annotated corpus based on data taken from the Norwegian popular science website Forskning.no. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 489341 sentences, 8321480 words and 13243 documents. [less]
The "NorGram Non-fiction text in Norwegian Bokmål from Forskning.no" treebank is a syntactically ann… [more]
The "NorGramBank Newspaper text in Norwegian Bokmål from the LBK" treebank is a syntactically annotated corpus based on data taken from the Norwegian reference corpus for Norwegian Bokmål, Leksikografisk Bokmålskorpus (LBK). This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 173914 sentences, 2661597 words and 599 documents. [less]
The "NorGramBank Newspaper text in Norwegian Bokmål from the LBK" treebank is a syntactically annota… [more]
The "NorGramBank non-fiction text in Norwegian Bokmål from the LBK" treebank is a syntactically annotated corpus based on data taken from the Norwegian reference corpus for Norwegian Bokmål, Leksikografisk Bokmålskorpus (LBK). This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 173914 sentences, 2661597 words and 599 documents. [less]
The "NorGramBank non-fiction text in Norwegian Bokmål from the LBK" treebank is a syntactically anno… [more]
The "NorGramBank television subtitles in Norwegian Bokmål from LBK" treebank is a syntactically annotated corpus based on data taken from the Norwegian reference corpus for Norwegian Bokmål, Leksikografisk Bokmålskorpus (LBK). This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 18043 sentences, 127844 words and 16 documents. [less]
The "NorGramBank television subtitles in Norwegian Bokmål from LBK" treebank is a syntactically anno… [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The treebank "NorGram NDT in LFG in Norwegian Bokmål (derivate from the Norwegian Dependency Treebank)" is based on the text material in the Norwegian Dependency Treebank (NDT), available from Språkbanken at National Library of Norway. The sentences have been parsed and disambiguated in the Norwegian LFG treebank using the NorGram LFG grammar. [less]
The treebank "NorGram NDT in LFG in Norwegian Bokmål (derivate from the Norwegian Dependency Treeban… [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The «Corona texts from NRK» treebank is a syntactically annotated corpus. It is based on data transcribed from the two newscasts Dagsrevyen and Supernytt produced by the Norwegian Broadcasting Corporation (NRK). [less]
The «Corona texts from NRK» treebank is a syntactically annotated corpus. It is based on data transc… [more]
The INESS Sofie Norwegian Treebank. The treebank is a syntactically annotated corpus based on the first chapters of the novel “Sofies verden” by Jostein Gaarder, published by Aschehoug forlag. The sentence-analyses are produced by INESS for the META-NORD project, whose goal was to promote the accessability of existing treebanks for the languages in the project. The corpus is automatically analyzed with the NorGram LFG grammar and all analyses are manually verified. [less]
The INESS Sofie Norwegian Treebank. The treebank is a syntactically annotated corpus based on the … [more]
The treebank "NorGrambank children's fiction in Norwegian Nynorsk" is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 106434 sentences, 1043260 words, 76 documents. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The treebank "NorGrambank children's fiction in Norwegian Nynorsk" is a syntactically annotated corp… [more]
The treebank "NorGram NDT in LFG in Norwegian Nynorsk (derivate from Norwegian Dependency Treebank)" is based on the text material in the Norwegian Dependency Treebank (NDT), available from Språkbanken at National Library of Norway. The sentences have been parsed and disambiguated in the Norwegian LFG treebank using the NorGram LFG grammar. [less]
The treebank "NorGram NDT in LFG in Norwegian Nynorsk (derivate from Norwegian Dependency Treebank)"… [more]
The treebank "NorGramBank annotations of Newspaper text from 'Nynorskkorpuset ved Norsk Ordbok 2014'" is a syntactically annotated corpus which uses text extracts from Nynorskkorpuset ved Norsk Ordbok 2014 (no2014.uio.no). This treebank is part of INESS NorGramBank collection (see URL in metadata). [less]
The treebank "NorGramBank annotations of Newspaper text from 'Nynorskkorpuset ved Norsk Ordbok 2014'… [more]
The treebank "Annotations of non-fiction text from 'Nynorskkorpuset ved Norsk Ordbok 2014'" is a syntactically annotated corpus which uses text extracts from Nynorskkorpuset ved Norsk Ordbok 2014 (no2014.uio.no). This treebank is part of INESS NorGramBank collection (see URL in metadata). [less]
The treebank "Annotations of non-fiction text from 'Nynorskkorpuset ved Norsk Ordbok 2014'" is a syn… [more]
The treebank "Annotations of fiction text from 'Nynorskkorpuset ved Norsk Ordbok 2014' is a syntactically annotated corpus which uses text extracts from Nynorskkorpuset ved Norsk Ordbok 2014 (no2014.uio.no). This treebank is part of INESS NorGramBank collection (see URL in metadata). [less]
The treebank "Annotations of fiction text from 'Nynorskkorpuset ved Norsk Ordbok 2014' is a syntacti… [more]
The "NorGramBank fiction in Norwegian Nynorsk" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 260285 sentences, 2884376 words and 91 documents. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Nynorsk" treebank is a syntactically annotated corpus based on… [more]