2024-03-28T13:46:12Zhttps://clarino.uib.no/oaioai:clarino.uib.no:bul-treebank2018-03-05T11:30:38ZINESSGunn Inger Lyse Samdal2015-05-26hdl:11495/D935-7715-CE31-0clarin.eu:cr1:p_1407745711925Clarino UiBResourcehdl:11495/D93F-C6E9-65D9-2LandingPagehdl:11495/D93F-C6E9-65D9-2corpusThe Morphologically Annotated Part of BulTreeBankThis distribution represents only the morphological information encoded in BulTreeBank - HPSG-based Treebank of Bulgarian. It contains about 214000 tokens. It was used for the training of the TreeTagger for Bulgarian.
It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts.
Full documentation (Style Book, Tagset description) of the Treebank can be found on: http://www.bultreebank.org/TechRep.htmlBulTreeBank-Morphhttp://clarino.uib.no/korpuskel/landing-page?resource=bul-treebank&view=shorthttp://www.bultreebank.org/btbmorf/hdl:11495/D93F-C6E9-65D9-2bul-treebankPublicdownloadableaccessibleThroughInterfacehttp://www.bultreebank.org/btbmorf/https://hdl.handle.net/11495/D93F-C6E9-65D9-2META-SHARE (MS)META-SHARE NonCommercial NoRedistribution (MS-NC-NoReD)http://www.meta-net.eu/meta-share/meta-share-licenses/META-SHARE%20NonCommercial%20NoRedistribution-v%201.0.pdfBYIDLRTNCNOREDpersonSimovKirilmaleAssociate ProfessorBulgarian Academy of SciencesBulTreeBank Group, Linguistic Modelling Laboratory, IICTkivs@bultreebank.orghttp://www.bultreebank.org/btbmorf/
Acad. G.Bonchev 25A
1113SofiaBulgaria+359888473413personSimovKirilmaleAssociate ProfessorBulgarian Academy of SciencesBulTreeBank Group, Linguistic Modelling Laboratory, IICTkivs@bultreebank.orghttp://www.bultreebank.org/btbmorf/
Acad. G.Bonchev 25A
1113SofiaBulgaria+3598884734132015-05-26META-SHAREhttp://metashare.nb.no/repository/browse/the-morphologically-annotated-part-of-bultreebank/b3f0ba40395711e2b66e001708556d5a5db5c7f848dc4048b06b47f7835d6956/Englishen2018-03-05personLyseGunn IngerfemaleResearcher (Ph.D)University of BergenUniversitetet i BergenUiBUoBDepartment of Linguistic, Literary and Aesthetic Studiesiness@uib.noclarin@uib.nopersonSimovKirilmaleAssociate ProfessorBulgarian Academy of SciencesBulTreeBank Group, Linguistic Modelling Laboratory, IICTkivs@bultreebank.orghttp://www.bultreebank.org/btbmorf/
Acad. G.Bonchev 25A
1113SofiaBulgaria+359888473413documentationhttp://www.bultreebank.org/btbmorf/Written CorpustextIt contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. For a full text acknowledgement, see:
http://www.bultreebank.org/TextAcknowledgements.htmlmonolingualbgBulgarian214000tokensmorphosyntacticAnnotation-posTagginghttp://www.bultreebank.org/TechRep/BTB-TR03.pdfHPSGmixedThe morphological analyzer assigns all possible morphosyntactic analyses to tokens.The process of disambiguation is two-fold: first a set of 'certain' rules are applied, to ensure full precision. Then the rest of the corpus has been disambiguated manually. (Source: p.2, http://www.bultreebank.org/TechRep/BTB-TR03.pdf)annotationManualSeveral documents can be found at: http://www.bultreebank.org/TechRep.html.
Selected document: Kiril Simov, Petya Osenova and Milena Slavcheva. BTB-TR03: BulTreeBank Morphosyntactic Tagset. BulTreeBank Project Technical Report № 03. 2004