The ParGram collection is a collection of parallel treebanks covering a set of chosen syntactic constructions. The ParGram collection is a collaborative effort of the ParGram project, along with the ParSem project, by researcher groups in industrial and academic institutions around the world. The aim of ParGram is to produce wide coverage grammars for a variety of languages. These are written collaboratively within the linguistic framework of LFG (Lexical Functional Grammar) and with a commonly-agreed-upon set of grammatical features. The XLE (Xerox Linguistic Environment) is used as a development platform. ParSem develops semantic structures based on the ParGram syntactic structures. Most of the ParSem systems use the XLE’s XFR system. Regular semiannual meetings are being held to bring together the various research groups involved in ParGram and ParSem. [less]
The ParGram collection is a collection of parallel treebanks covering a set of chosen syntactic cons… [more]
The ParTMA collection is a collaborative effort by researcher groups in academic institutions around the world. The aim of ParTMA is to produce parallel treebanks that cover constructions relevant for the semantics of Tense, Mode and Aspect. The treebank sentences are analyzed with the grammars in the ParGram project. [less]
The ParTMA collection is a collaborative effort by researcher groups in academic institutions around… [more]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-1983). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.0 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further detai… [more]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-2515). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.1 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further detai… [more]
The "Universal Dependencies 1.1 - Hungarian" is part of the Universal Dependencies 1.1 collection, which is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/LRT-1478). The individual treebanks have individual licenses, and the specific license and conditions of use for each treebank are given in the joint license "Universal Dependencies v1.1 License Agreement". In common for all the licenses is that they are in the public domain (some Creative Commons licenses, some GPL licenses). Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). This is the second release of UD Treebanks; a newer version 1.3 is also available. [less]
The "Universal Dependencies 1.1 - Hungarian" is part of the Universal Dependencies 1.1 collection, w… [more]
Julius Caesar’s account of the Gallic war, written 58–49 BC. Edition: T. Rice Holmes (1914): C. Iuli Commentarii Rerum in Gallia Gestarum VII A. Hirti Commentarius VII. Oxford: Oxford University Press. Electronic edition: Gregory Crane: De bello gallico. Perseus Digital Library. Tufts University, Medford MA. [less]
Julius Caesar’s account of the Gallic war, written 58–49 BC. Edition: T. Rice Holmes (1914): C. Iuli… [more]
A collection of letters from Marcus Tullius Cicero to Titus Pomponius Atticus, written 68–44 BC. Edition: L. C. Purser (1901): Epistulae ad Atticum. Oxford: Oxford University Press. Electronic edition: Gregory Crane: Letters to Atticus. Perseus Digital Library, Tufts University, Medford MA. [less]
A collection of letters from Marcus Tullius Cicero to Titus Pomponius Atticus, written 68–44 BC. Edi… [more]
Fourth-century account of a pilgrimage to the Holy Land. Edition: Wilhelm Heraeus (1908): Silviae vel potius Aetheriae peregrinatio. Heidelberg: Carl Winter. Electronic edition: Itinerarium vel peregrinatio ad loca sancta. Bibliotheca Augustana. [less]
Fourth-century account of a pilgrimage to the Holy Land. Edition: Wilhelm Heraeus (1908): Silviae ve… [more]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-1983). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.0 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further detai… [more]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-2515). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.1 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further detai… [more]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-1983). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.0 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further detai… [more]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-2515). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.1 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further detai… [more]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-1983). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.0 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further detai… [more]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-2515). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.1 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further detai… [more]
The "NorGramBank – Newspaper text (years 2012, 2013) in Norwegian Bokmål from the Norwegian Newspaper Corpus" treebank is a syntactically annotated corpus based on data taken from the years 2012 and 2013 from the Norwegian Newspaper Corpus (NCC). This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 246397 sentences, 3157558 words and 1543 documents. Note that the available treebank contains only those newspaper articles from 2012 and 2013 that have been manually preprocessed; see details otherwheres in the metadata. [less]
The "NorGramBank – Newspaper text (years 2012, 2013) in Norwegian Bokmål from the Norwegian Newspape… [more]
The "NorGramBank children’s fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 389564 sentences, 4111213 words and 155 documents. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank children’s fiction in Norwegian Bokmål" treebank is a syntactically annotated corpu… [more]
The "NorGram Non-fiction text in Norwegian Bokmål from Forskning.no" treebank is a syntactically annotated corpus based on data taken from the Norwegian popular science website Forskning.no. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 489341 sentences, 8321480 words and 13243 documents. [less]
The "NorGram Non-fiction text in Norwegian Bokmål from Forskning.no" treebank is a syntactically ann… [more]
The "NorGramBank Newspaper text in Norwegian Bokmål from the LBK" treebank is a syntactically annotated corpus based on data taken from the Norwegian reference corpus for Norwegian Bokmål, Leksikografisk Bokmålskorpus (LBK). This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 173914 sentences, 2661597 words and 599 documents. [less]
The "NorGramBank Newspaper text in Norwegian Bokmål from the LBK" treebank is a syntactically annota… [more]
The "NorGramBank non-fiction text in Norwegian Bokmål from the LBK" treebank is a syntactically annotated corpus based on data taken from the Norwegian reference corpus for Norwegian Bokmål, Leksikografisk Bokmålskorpus (LBK). This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 173914 sentences, 2661597 words and 599 documents. [less]
The "NorGramBank non-fiction text in Norwegian Bokmål from the LBK" treebank is a syntactically anno… [more]
The "NorGramBank television subtitles in Norwegian Bokmål from LBK" treebank is a syntactically annotated corpus based on data taken from the Norwegian reference corpus for Norwegian Bokmål, Leksikografisk Bokmålskorpus (LBK). This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 18043 sentences, 127844 words and 16 documents. [less]
The "NorGramBank television subtitles in Norwegian Bokmål from LBK" treebank is a syntactically anno… [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in older Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in older Norwegian Bokmål" treebank is a syntactically annotated corpus bas… [more]
The "NorGramBank fiction in older Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in older Norwegian Bokmål" treebank is a syntactically annotated corpus bas… [more]
The "NorGramBank fiction in older Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in older Norwegian Bokmål" treebank is a syntactically annotated corpus bas… [more]
The treebank "NorGram NDT in LFG in Norwegian Bokmål (derivate from the Norwegian Dependency Treebank)" is based on the text material in the Norwegian Dependency Treebank (NDT), available from Språkbanken at National Library of Norway. The sentences have been parsed and disambiguated in the Norwegian LFG treebank using the NorGram LFG grammar. [less]
The treebank "NorGram NDT in LFG in Norwegian Bokmål (derivate from the Norwegian Dependency Treeban… [more]
The "NorGram Newspaper text (30 documents from the years 2006 - 2009) in Norwegian Bokmål from the Norwegian Newspaper Corpus" treebank is a syntactically annotated corpus based on 30 documents taken from the years 2006 - 2009 from the Norwegian Newspaper Corpus (NCC). This treebank is part of INESS NorGramBank collection (see URL in metadata). Note that the available treebank contains only those newspaper articles from 2012 and 2013 that have been manually preprocessed; see details otherwheres in the metadata. [less]
The "NorGram Newspaper text (30 documents from the years 2006 - 2009) in Norwegian Bokmål from the N… [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata). As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing. [less]
The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on … [more]
The «Corona texts from NRK» treebank is a syntactically annotated corpus. It is based on data transcribed from the two newscasts Dagsrevyen and Supernytt produced by the Norwegian Broadcasting Corporation (NRK). [less]
The «Corona texts from NRK» treebank is a syntactically annotated corpus. It is based on data transc… [more]
The ParGram collection is a collection of parallel treebanks covering a set of chosen syntactic constructions. The ParGram collection is a collaborative effort of the ParGram project, along with the ParSem project, by researcher groups in industrial and academic institutions around the world. The aim of ParGram is to produce wide coverage grammars for a variety of languages. These are written collaboratively within the linguistic framework of LFG (Lexical Functional Grammar) and with a commonly-agreed-upon set of grammatical features. The XLE (Xerox Linguistic Environment) is used as a development platform. ParSem develops semantic structures based on the ParGram syntactic structures. Most of the ParSem systems use the XLE’s XFR system. Regular semiannual meetings are being held to bring together the various research groups involved in ParGram and ParSem. [less]
The ParGram collection is a collection of parallel treebanks covering a set of chosen syntactic cons… [more]
The ParTMA collection is a collaborative effort by researcher groups in academic institutions around the world. The aim of ParTMA is to produce parallel treebanks that cover constructions relevant for the semantics of Tense, Mode and Aspect. The treebank sentences are analyzed with the grammars in the ParGram project. [less]
The ParTMA collection is a collaborative effort by researcher groups in academic institutions around… [more]
The INESS Sofie Norwegian Treebank. The treebank is a syntactically annotated corpus based on the first chapters of the novel “Sofies verden” by Jostein Gaarder, published by Aschehoug forlag. The sentence-analyses are produced by INESS for the META-NORD project, whose goal was to promote the accessability of existing treebanks for the languages in the project. The corpus is automatically analyzed with the NorGram LFG grammar and all analyses are manually verified. [less]
The INESS Sofie Norwegian Treebank. The treebank is a syntactically annotated corpus based on the … [more]
The Norwegian part of the META-NORD Sofie Parallel Treebank, a syntactically annotated parallel corpus based on the first chapters of the novel “Sofies verden” (Sophie's World) by Jostein Gaarder, published by Aschehoug forlag. The treebank consists of grammatical annotations of extracts from the original and was created by the INESS project for META-NORD. For more information, see the metadata description of the META-NORD Sofie Parallel Treebank. [less]
The Norwegian part of the META-NORD Sofie Parallel Treebank, a syntactically annotated parallel corp… [more]
The treebank "Norwegian Dependency Treebank in Norwegian Bokmål (copy @ INESS)" is a syntactically annotated corpus, created by the National Library of Norway. The copy in INESS allows for searches in this treebank using the INESS search system. The original is downloadable at Språkbanken. [less]
The treebank "Norwegian Dependency Treebank in Norwegian Bokmål (copy @ INESS)" is a syntactically a… [more]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-1983). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.0 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.0” collection is searchable at the INESS portal; to read further detai… [more]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, we refer to the original, which is located at the LINDAT/CLARIN Centre for Language Research Infrastructure (http://hdl.handle.net/11234/1-2515). The individual treebanks have individual licenses, which are available through the joint license “Universal Dependencies v2.1 License Agreement”. Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). [less]
The “Universal Dependencies 2.1” collection is searchable at the INESS portal; to read further detai… [more]
The treebank "orv-afnik-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-afnik-dep" is part of the TOROT treebank collection. The TOROT Treebank is a depe… [more]
The treebank "orv-avv-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-avv-dep" is part of the TOROT treebank collection. The TOROT Treebank is a depend… [more]
The treebank "orv-const-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-const-dep" is part of the TOROT treebank collection. The TOROT Treebank is a depe… [more]
The treebank "orv-domo-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-domo-dep" is part of the TOROT treebank collection. The TOROT Treebank is a depen… [more]
The treebank "orv-drac-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-drac-dep" is part of the TOROT treebank collection. The TOROT Treebank is a depen… [more]
The treebank "orv-kiev-hyp-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-kiev-hyp-dep" is part of the TOROT treebank collection. The TOROT Treebank is a d… [more]
The treebank "orv-lav-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-lav-dep" is part of the TOROT treebank collection. The TOROT Treebank is a depend… [more]
The treebank "orv-luk-koloc-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-luk-koloc-dep" is part of the TOROT treebank collection. The TOROT Treebank is a … [more]
The treebank "orv-mst-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-mst-dep" is part of the TOROT treebank collection. The TOROT Treebank is a depend… [more]
The treebank "orv-novgorod-jaroslav-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-novgorod-jaroslav-dep" is part of the TOROT treebank collection. The TOROT Treeba… [more]
The treebank "orv-pskov-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-pskov-dep" is part of the TOROT treebank collection. The TOROT Treebank is a depe… [more]
The treebank "orv-pskov-ivan-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-pskov-ivan-dep" is part of the TOROT treebank collection. The TOROT Treebank is a… [more]
The treebank "orv-riga-goth-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-riga-goth-dep" is part of the TOROT treebank collection. The TOROT Treebank is a … [more]
The treebank "orv-rig-smol1281-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-rig-smol1281-dep" is part of the TOROT treebank collection. The TOROT Treebank is… [more]
The treebank "orv-rusprav-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-rusprav-dep" is part of the TOROT treebank collection. The TOROT Treebank is a de… [more]
The treebank "orv-sergrad-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-sergrad-dep" is part of the TOROT treebank collection. The TOROT Treebank is a de… [more]
The treebank "orv-smol-pol-lit-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-smol-pol-lit-dep" is part of the TOROT treebank collection. The TOROT Treebank is… [more]
The treebank "orv-usp-sbor-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-usp-sbor-dep" is part of the TOROT treebank collection. The TOROT Treebank is a d… [more]
The treebank "orv-ust-vlad-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-ust-vlad-dep" is part of the TOROT treebank collection. The TOROT Treebank is a d… [more]
The treebank "orv-varlaam-dep" is part of the TOROT treebank collection. The TOROT Treebank is a dependency treebank with morphosyntactic and information-structure annotation. It includes texts in Old Church Slavonic, Old Russian and Middle Russian and is freely available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. The treebank is an expansion of the Slavic part of the PROIEL corpus and was started as part of the research project Birds and Beasts: Shaping Events in Old Russian, which was financed by the Norwegian Research Council. The treebank is still in active development. [less]
The treebank "orv-varlaam-dep" is part of the TOROT treebank collection. The TOROT Treebank is a de… [more]