INESS-logo
Treebank Selection

Treebanks

Tools


Select a set of treebanks to work with. ?
Languages: All · Abaza (0/3) · Abkhazian (0/3) · Afrikaans (0/7) · Akkadian (0/10) · Akuntsu (0/4) · Albanian (0/3) · Amharic (0/6) · Ancient Greek (to 1453) (3/27) · Ancient Hebrew (0/3) · Apurinã (0/4) · Arabic (0/25) · Armenian (0/9) · Assyrian Neo-Aramaic (0/5) · Azerbaijani (0/2) · Bambara (0/6) · Basque (0/12) · Bavarian (0/2) · Beja (0/3) · Belarusian (0/7) · Bengali (0/3) · Bhojpuri (0/5) · Borôro (0/3) · Breton (0/6) · Bulgarian (0/13) · Buriat (0/7) · Catalan (0/10) · Cebuano (0/3) · Chinese (0/43) · Chukot (0/4) · Church Slavic (3/14) · Classical Armenian (1/3) · Coptic (0/8) · Croatian (0/12) · Czech (0/48) · Danish (0/14) · Dutch (1/22) · Egyptian (Ancient) (0/2) · Emerillon (0/3) · English (7/80) · Erzya (0/6) · Estonian (0/18) · Faroese (0/11) · Finnish (0/37) · French (0/54) · Galician (0/21) · Gbaya (Central African Republic) (0/1) · Georgian (5/13) · German (5/42) · Gheg Albanian (0/3) · Gothic (1/12) · Guajajára (0/4) · Gujarati (0/2) · Gweno (0/1) · Haitian (0/2) · Hausa (0/4) · Hebrew (0/16) · Hindi (0/18) · Hittite (0/3) · Hungarian (2/16) · Icelandic (0/18) · Indonesian (2/24) · Irish (0/19) · Italian (1/57) · Jamamadí (0/3) · Japanese (0/34) · Javanese (0/3) · K'iche' (0/4) · Kangri (0/4) · Karelian (0/5) · Karo (Ethiopia) (0/3) · Kazakh (0/10) · Khunsari (0/4) · Kirghiz (0/5) · Komi (0/12) · Komi-Permyak (0/5) · Korean (0/20) · Latgalian (0/2) · Latin (4/47) · Latvian (0/13) · Ligurian (0/3) · Literary Chinese (0/5) · Lithuanian (0/12) · Livvi (0/5) · Low German (0/4) · Luxembourgish (0/2) · Macedonian (0/2) · Makuráp (0/4) · Malayalam (0/3) · Maltese (0/6) · Manx (0/4) · Marathi (0/7) · Mbyá Guaraní (0/10) · Middle French (ca. 1400-1600) (0/2) · Modern Greek (1453-) (1/19) · Moksha (0/5) · Mundurukú (0/4) · (0/1) · (0/2) · (0/1) · (0/1) · (0/1) · (0/1) · Nayini (0/4) · Neapolitan (0/3) · Nhengatu (0/3) · Nigerian Pidgin (0/6) · Northern Kurdish (0/7) · Northern Sami (0/32) · Norwegian (5) · Norwegian Bokmål (38/66) · Norwegian Nynorsk (7/28) · Old English (ca. 450-1100) (5) · Old French (842-ca. 1400) (1/7) · Old Irish (to 900) (0/6) · Old Norse (7) · Old Russian (20/34) · Old Turkish (0/4) · Paraguayan Guaraní (0/3) · Paumarí (0/2) · Pech (0/1) · Persian (0/16) · Phrygian (0/1) · Polish (23/46) · Pomak (0/3) · Portuguese (4/42) · Pushto (0/1) · Qafar (0/3) · Romanian (0/29) · Russian (1/38) · Sanskrit (0/12) · Saya (0/3) · Scottish Gaelic (0/5) · Serbian (0/7) · Sinhala (0/3) · Skolt Sami (0/5) · Slovak (0/9) · Slovenian (0/22) · Sonha (0/4) · South Levantine Arabic (0/4) · Spanish (0/31) · Spanish Sign Language (0/1) · Swedish (0/31) · Swedish Sign Language (0/8) · Swiss German (0/5) · Tagalog (0/10) · Tamil (0/16) · Tatar (0/3) · Telugu (0/9) · Thai (0/6) · Tswana (0/2) · Tupinambá (0/4) · Turkish (1/54) · Uighur (0/9) · Ukrainian (0/8) · Umbrian (0/3) · Upper Sorbian (0/7) · Urdu (2/10) · Urubú-Kaapor (0/4) · Uzbek (0/1) · Veps (0/2) · Vietnamese (0/11) · Warlpiri (0/6) · Welsh (0/5) · Western Armenian (0/4) · Western Frisian (0/4) · Wolof (3/8) · Xavánte (0/3) · Xibe (0/3) · Yakut (0/3) · Yoruba (0/6) · Yue Chinese (0/7) · Yupik (0/4) · Zacatlán-Ahuacatlán-Tepetzintla Nahuatl (0/3)
Treebank Collections: All · Acquis (1/7) · Alpino (1) · BulTreeBank (0/1) · CLARIN-PL (5) · DELPH-IN (2) · GEGO (0/4) · GNC (0/2) · GeoGram (4) · HunGram (4) · ISWOC (9) · JOS (0/1) · Menotec (7) · Mercurius (0/1) · NAOB (15) · NDT (4/6) · NorGram (58) · NorGramBank (40) · Ordbøkene (8) · POLFIE (23) · PROIEL (10) · PaHC (0/2) · ParGram (11) · ParTMA (15) · Sami-open (0/15) · Sami-restricted (0/7) · Sofie (2/9) · TIGER (2/3) · TOROT (22) · Universal Dependencies 1.1 (0/19) · Universal Dependencies 1.2 (0/36) · Universal Dependencies 1.3 (0/53) · Universal Dependencies 1.4 (0/63) · Universal Dependencies 2.0 (0/63) · Universal Dependencies 2.1 (0/103) · Universal Dependencies 2.12 (0/245) · Universal Dependencies 2.14 (0/283) · Universal Dependencies 2.15 (0/297) · Universal Dependencies 2.3 (0/130) · Universal Dependencies 2.5 (0/157) · Universal Dependencies 2.8 (0/200) · WolGram (3) · XPar (2)
Treebank Types: All · lfg (102/129) · constituency (17/19) · constituency-alpino (1) · dependency (48) · dependency-cg (828/1691) · dependency-tuebadz (0/1) · hpsg (2)
Show only Parallel Treebanks

Show custom treebank:
Click on a treebank name below to proceed. All selected treebanks will be available for viewing and searching. | Show treebank descriptions
Selected Name Collection Type Sentences Words Indexed License Downloads
all | none 98 337 1 367 535
Ancient Greek (to 1453) (grc) 17 683   232 538
grc-chron-dep PROIEL dependency 976 23 421 yes CC-BY-NC-SA no
grc-greek-nt-dep PROIEL dependency 11 261 128 724 yes CC-BY-NC-SA no
grc-hdt-dep PROIEL dependency 5 446 80 393 yes CC-BY-NC-SA no
Church Slavic (chu) 13 538   125 805
chu-marianus-dep PROIEL dependency 6 350 58 544 yes CC-BY-NC-SA no
chu-supr-dep TOROT dependency 7 054 66 078 yes CC-BY-NC-SA no
chu-zogr-dep TOROT dependency 134 1 183 yes CC-BY-NC-SA no
Classical Armenian (xcl) 1 916   19 105
xcl-armenian-nt-dep PROIEL dependency 1 916 19 105 yes CC-BY-NC-SA no
Dutch (nld) 65 200   990 087
nld-lassy-con Alpino constituency-alpino 65 200 990 087 yes unspecified no