INESS-logo
Metadata

Treebanks

Tools


META-NORD Sofie Parallel Treebank
Full metadata record:
Persistent identifier for the resource:
Contact Person: Rosén, Victoria
This resource is licensed under the following terms:
unspecified
BY LRT NORED
BY LRT NORED
Please click on the link to read the license terms.
By accepting the terms of the license you will be granted access to the resource.
Attribution:
The "Sofie analyses" is research material based on the novel "Sofies verden" [Sophie's world] by Jostein Gaarder, published by Aschehoug Forlag. If you use INESS in your research, please link to the INESS webpage (http://clarino.uib.no/iness) in materials included with your data. We suggest the following reference in your scientific publications: Victoria Rosén, Koenraad De Smedt, Paul Meurer, and Helge Dyvik. An open infrastructure for advanced treebanking. In Jan Hajič, Koenraad De Smedt, Marko Tadić, and António Branco (eds.) META-RESEARCH Workshop on Advanced Treebanking at LREC2012, pages 22–29, Istanbul, Turkey, May 2012.
Language(s): Norwegian (no), Swedish (sv), Danish (da), Estonian (et), Georgian (ka), German (de), Icelandic (is), English (en)
Description:
The Sofie Parallel Treebank is a syntactically annotated parallel corpus based on the first chapters of the novel “Sofies verden” by Jostein Gaarder, published by Aschehoug forlag. The treebank is a product of the META-NORD project and its goal to promote the accessability of existing treebanks for the languages in the project.

SOURCE TEXT

The Norwegian novel Sofies verden (Gaarder 1991) was chosen as a suitable basis for treebanking because it is linguistically rich and professionally translated in many languages, and because some treebanks already existed for text selections from this material in some languages in the META-NORD area.

Previous work was done by the Nordic Treebank Network, funded by the Nordic Language Technology Program (2001-2005) but had not been maintained and was no longer accessible. It was decided to gather those treebanks, document them, supplement them with additional treebanks for some languages where this effort was feasible, and make the resulting resources accessible. The resulting work has been a joint effort between META-NORD and the INESS project, which hosts the treebank.

The rights for the Finnish treebank have not been cleared, and this treebank is currently unavailable.

More information about the treebank development in META-NORD is available in the META-NORD Deliverable 3.4 on Parallel Treebanks (http://www.meta-nord.eu).