INESS-logo
Metadata

Treebanks

Tools


Resource common info
Resource type: corpus
Identification info
Resource name: META-NORD Sofie English Treebank
Description: The English part of the META-NORD Sofie Parallel Treebank. This treebank is a syntactically annotated parallel corpus based on the first chapters of the novel “Sofies verden” (Sophie's World) by Jostein Gaarder, published by Aschehoug forlag. The treebank consists of grammatical annotations of extracts from the English translation of the novel, originally created as part of the Stockholm MULtilingual parallel TReebank (SMULTRON) and now included in the META-NORD Sofie Parallel Treebank. The novel was translated into English by Paulette Moller and the English translation is published by Phoenix House/The Orion Publishing Group (1995).
Resource short name: English Sofie
Url: http://clarino.uib.no/iness/landing-page?resource=eng-sofie-con, http://clarino.uib.no/iness/landing-page?resource=eng-sofie-con&view=short, http://clarino.uib.no/iness/landing-page?resource=sofie-par&view=short
PID: hdl:11495/DA13-B50A-F164-1
Identifier: eng-sofie-con
Distribution info
Licence info
User category: Public
Distribution access medium: downloadable, accessibleThroughInterface
Download location: http://clarino.uib.no/iness
Execution location: http://clarino.uib.no/iness
Attribution text: The "Sofie analyses" is research material based on the novel "Sofies verden" [Sophie's world] by Jostein Gaarder, published by Aschehoug Forlag. The English translation is published by Phoenix House/The Orion Publishing Group. The linguistic annotations were created as part of the Stockholm MULtilingual parallel TReebank (SMULTRON). Reference: Martin Volk, Anne Göhring, Annette Rios, Torsten Marek and Yvonne Samuelsson. SMULTRON (version 4.0) — The Stockholm MULtilingual parallel TReebank. 2015. URL: http://www.cl.uzh.ch/research/parallelcorpora/paralleltreebanks_en.html. Institute of Computational Linguistics, University of Zurich. The treebank is now included in the META-NORD Sofie Parallel Treebank. If you use INESS in your research, please link to the INESS webpage (http://clarino.uib.no/iness) in materials included with your data. In your scientific publications, please use the following reference: Victoria Rosén, Koenraad De Smedt, Paul Meurer, and Helge Dyvik. An open infrastructure for advanced treebanking. In Jan Hajič, Koenraad De Smedt, Marko Tadić, and António Branco (eds.) META-RESEARCH Workshop on Advanced Treebanking at LREC2012, pages 22–29, Istanbul, Turkey, May 2012.
Licence
unspecified
By accepting the terms of the license you will be granted access to the resource.
Licence URL: http://clarino.uib.no/comedi/licenses/sofie-parallel-license.txt
Conditions of use: BY, LRT, NORED
Ipr holder
Actor info
Actor type: person
Person info
Gaarder , Jostein
Communication info
Email: even.rakil@aschehougagency.no
Actor info
Actor type: organization
Organization info
Phoenix House/The Orion Publishing Group
Communication info
Email: rights.enquiries@orionbooks.co.uk
Url: https://www.orionbooks.co.uk
Country: English
Actor info
Actor type: organization
Organization info
Stockholm University
Contact
Actor info
Actor type: person
Person info
Rosén , Victoria
Position: Associate Professor
Affiliation
Organization info
University of Bergen , UoB
Department of Linguistic, Literary and Aesthetic Studies
Communication info
Email: iness@uib.no
Metadata info
Metadata creation date: 2015-06-19
Source: This metadata is based on the metadata originally created in META-SHARE in 2012. The present metadata should be considered as authoritative.
Original metadata schema: META-SHARE
Original metadata link: http://metashare.nb.no/repository/browse/meta-nord-sofie-english-treebank/88ea653a551d11e28914001708556d5a9c23a355d96b4f949633de2cf96473b0/
Metadata language name: English
Metadata language id: en
Metadata last date updated: 2015-12-10
Metadata creator
Actor info
Actor type: person
Person info
Lyse , Gunn Inger
Position: Researcher (Ph.D)
Affiliation
Organization info
University of Bergen, , UoB
Department of Linguistic, Literary and Aesthetic Studies
Communication info
Email: iness@uib.no, clarin@uib.no
Validation info
Validated: true
Validation type: content
Validation mode: mixed
Validation mode details: The parser in Annotate gives correct suggestions in about 70% of the cases, and the annotator always has the possibility to accept, get a new suggestion or create a new node manually. The DECCA tool has been used for completeness and consistency checking.
Validation report unstructured
Role: validationReport
Document unstructured: Detection of Errors and Correction in Corpus Annotation, http://decca.osu.edu/
Resource creation info
Funding project
Project info
Project name: Stockholm MULtilingual TReebank
Project short name: SMULTRON
Url: http://www.ling.su.se/english/nlp/corpora-and-resources/smultron/stockholm-multilingual-treebank-smultron-1.14047
Funding type: other
Funding project
Project info
Project name: META-NORD
Project ID: The META-NORD project has received funding from the European Commission through the CIP ICT PSP Prog
Url: http://meta-nord.eu
Funding type: euFunds
Funder: European Commission through the CIP ICT PSP Programme
Project start date: 2011-02-01
Project end date: 2013-01-31