INESS-logo
Treebank Selection

Treebanks

Tools


Select a set of treebanks to work with. ?
Languages: All · Abaza (3) · Abkhazian (3) · Afrikaans (7) · Akkadian (10) · Akuntsu (4) · Albanian (3) · Amharic (6) · Ancient Greek (to 1453) (27) · Ancient Hebrew (3) · Apurinã (4) · Arabic (25) · Armenian (9) · Assyrian Neo-Aramaic (5) · Azerbaijani (2) · Bambara (6) · Basque (12) · Bavarian (2) · Beja (3) · Belarusian (7) · Bengali (3) · Bhojpuri (5) · Borôro (3) · Breton (6) · Bulgarian (13) · Buriat (7) · Catalan (10) · Cebuano (3) · Chinese (43) · Chukot (4) · Church Slavic (14) · Classical Armenian (3) · Coptic (8) · Croatian (12) · Czech (48) · Danish (14) · Dutch (22) · Egyptian (Ancient) (2) · Emerillon (3) · English (80) · Erzya (6) · Estonian (18) · Faroese (11) · Finnish (37) · French (54) · Galician (21) · Gbaya (Central African Republic) (1) · Georgian (13) · German (42) · Gheg Albanian (3) · Gothic (12) · Guajajára (4) · Gujarati (2) · Gweno (1) · Haitian (2) · Hausa (4) · Hebrew (16) · Hindi (18) · Hittite (3) · Hungarian (16) · Icelandic (18) · Indonesian (24) · Irish (19) · Italian (57) · Jamamadí (3) · Japanese (34) · Javanese (3) · K'iche' (4) · Kangri (4) · Karelian (5) · Karo (Ethiopia) (3) · Kazakh (10) · Khunsari (4) · Kirghiz (5) · Komi (12) · Komi-Permyak (5) · Korean (20) · Latgalian (2) · Latin (47) · Latvian (13) · Ligurian (3) · Literary Chinese (5) · Lithuanian (12) · Livvi (5) · Low German (4) · Luxembourgish (2) · Macedonian (2) · Makuráp (4) · Malayalam (3) · Maltese (6) · Manx (4) · Marathi (7) · Mbyá Guaraní (10) · Middle French (ca. 1400-1600) (2) · Modern Greek (1453-) (19) · Moksha (5) · Mundurukú (4) · (1) · (2) · (1) · (1) · (1) · (1) · Nayini (4) · Neapolitan (3) · Nhengatu (3) · Nigerian Pidgin (6) · Northern Kurdish (7) · Northern Sami (32) · Norwegian (5) · Norwegian Bokmål (66) · Norwegian Nynorsk (28) · Old English (ca. 450-1100) (5) · Old French (842-ca. 1400) (7) · Old Irish (to 900) (6) · Old Norse (7) · Old Russian (34) · Old Turkish (4) · Paraguayan Guaraní (3) · Paumarí (2) · Pech (1) · Persian (16) · Phrygian (1) · Polish (46) · Pomak (3) · Portuguese (42) · Pushto (1) · Qafar (3) · Romanian (29) · Russian (38) · Sanskrit (12) · Saya (3) · Scottish Gaelic (5) · Serbian (7) · Sinhala (3) · Skolt Sami (5) · Slovak (9) · Slovenian (22) · Sonha (4) · South Levantine Arabic (4) · Spanish (31) · Spanish Sign Language (1) · Swedish (31) · Swedish Sign Language (8) · Swiss German (5) · Tagalog (10) · Tamil (16) · Tatar (3) · Telugu (9) · Thai (6) · Tswana (2) · Tupinambá (4) · Turkish (54) · Uighur (9) · Ukrainian (8) · Umbrian (3) · Upper Sorbian (7) · Urdu (10) · Urubú-Kaapor (4) · Uzbek (1) · Veps (2) · Vietnamese (11) · Warlpiri (6) · Welsh (5) · Western Armenian (4) · Western Frisian (4) · Wolof (8) · Xavánte (3) · Xibe (3) · Yakut (3) · Yoruba (6) · Yue Chinese (7) · Yupik (4) · Zacatlán-Ahuacatlán-Tepetzintla Nahuatl (3)
Treebank Collections: All · Acquis (0/7) · Alpino (0/1) · BulTreeBank (0/1) · CLARIN-PL (0/5) · DELPH-IN (0/2) · GEGO (0/4) · GNC (0/2) · GeoGram (0/4) · HunGram (0/4) · ISWOC (0/9) · JOS (0/1) · Menotec (0/7) · Mercurius (0/1) · NAOB (0/15) · NDT (0/6) · NorGram (0/58) · NorGramBank (0/40) · Ordbøkene (0/8) · POLFIE (0/23) · PROIEL (1/10) · PaHC (0/2) · ParGram (0/11) · ParTMA (0/15) · Sami-open (0/15) · Sami-restricted (0/7) · Sofie (0/9) · TIGER (0/3) · TOROT (20/22) · Universal Dependencies 1.1 (0/19) · Universal Dependencies 1.2 (2/36) · Universal Dependencies 1.3 (2/53) · Universal Dependencies 1.4 (2/63) · Universal Dependencies 2.0 (2/63) · Universal Dependencies 2.1 (3/103) · Universal Dependencies 2.12 (9/245) · Universal Dependencies 2.14 (9/283) · Universal Dependencies 2.15 (9/297) · Universal Dependencies 2.3 (3/130) · Universal Dependencies 2.5 (3/157) · Universal Dependencies 2.8 (5/200) · WolGram (0/3) · XPar (0/2)
Treebank Types: All · lfg (0/129) · constituency (0/19) · constituency-alpino (0/1) · dependency (21/48) · dependency-cg (49/1691) · dependency-tuebadz (0/1) · hpsg (0/2)
Show only Parallel Treebanks

Show custom treebank:
Click on a treebank name below to proceed. All selected treebanks will be available for viewing and searching. | Show treebank descriptions
Selected Name Collection Type Sentences Words Indexed License Downloads
all | none 178 930 3 062 804
Gothic (got) 42 381   435 422
got-gothic-nt-dep PROIEL dependency 5 457 55 878 yes CC-BY-NC-SA no
got-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 5 450 56 134 no unspecified no
got-ud-1.3-dep Universal Dependencies 1.3 dependency-cg 5 450 56 134 no unspecified no
got-ud-1.4-dep Universal Dependencies 1.4 dependency-cg 5 450 56 134 no unspecified no
got-ud-2.0-dep Universal Dependencies 2.0 dependency-cg 4 372 45 142 no unspecified no
got-ud-2.1-dep Universal Dependencies 2.1 dependency-cg 5 400 55 324 no unspecified no
got-ud-2.3-dep Universal Dependencies 2.3 dependency-cg 5 401 55 340 no unspecified no
got-ud-2.5-dep Universal Dependencies 2.5 dependency-cg 5 401 55 336 no (Accepted) no
Hindi (hin) 117 845   2 456 809
hin-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 647 346 259 no unspecified no
hin-ud-1.3-dep Universal Dependencies 1.3 dependency-cg 16 647 346 259 no unspecified no
hin-ud-1.4-dep Universal Dependencies 1.4 dependency-cg 16 647 346 259 no unspecified no
hin-ud-2.0-dep Universal Dependencies 2.0 dependency-cg 14 963 311 284 no unspecified no
hin-ud-2.1-dep Universal Dependencies 2.1 dependency-cg 16 647 346 259 no unspecified no
hin-ud-hdtb-2.3-dep Universal Dependencies 2.3 dependency-cg 16 647 346 259 no unspecified no
hin-ud-hdtb-2.5-dep Universal Dependencies 2.5 dependency-cg 16 647 346 259 no (Accepted) no
hin-ud-pud-2.1-dep Universal Dependencies 2.1 dependency-cg 1 000 22 657 no unspecified no
hin-ud-pud-2.3-dep Universal Dependencies 2.3 dependency-cg 1 000 22 657 no unspecified no
hin-ud-pud-2.5-dep Universal Dependencies 2.5 dependency-cg 1 000 22 657 no (Accepted) no
Hittite (hit) 0   0
Old Russian (orv) 18 704   170 573
orv-afnik-dep TOROT dependency 889 6 471 yes CC-BY-NC-SA no
orv-avv-dep TOROT dependency 3 238 22 180 yes CC-BY-NC-SA no
orv-const-dep TOROT dependency 755 8 920 yes CC-BY-NC-SA no
orv-domo-dep TOROT dependency 1 902 22 262 yes CC-BY-NC-SA no
orv-drac-dep TOROT dependency 288 2 438 yes CC-BY-NC-SA no
orv-kiev-hyp-dep TOROT dependency 57 530 yes CC-BY-NC-SA no
orv-lav-dep TOROT dependency 7 128 52 316 yes CC-BY-NC-SA no
orv-luk-koloc-dep TOROT dependency 91 872 yes CC-BY-NC-SA no
orv-mst-dep TOROT dependency 8 157 yes CC-BY-NC-SA no
orv-novgorod-jaroslav-dep TOROT dependency 30 410 yes CC-BY-NC-SA no
orv-pskov-dep TOROT dependency 201 2 301 yes CC-BY-NC-SA no
orv-pskov-ivan-dep TOROT dependency 26 331 yes CC-BY-NC-SA no
orv-riga-goth-dep TOROT dependency 111 1 499 yes CC-BY-NC-SA no
orv-rig-smol1281-dep TOROT dependency 13 167 yes CC-BY-NC-SA no
orv-rusprav-dep TOROT dependency 421 3 930 yes CC-BY-NC-SA no
orv-sergrad-dep TOROT dependency 1 441 19 905 yes CC-BY-NC-SA no
orv-smol-pol-lit-dep TOROT dependency 23 335 yes CC-BY-NC-SA no
orv-usp-sbor-dep TOROT dependency 2 043 24 927 yes CC-BY-NC-SA no
orv-ust-vlad-dep TOROT dependency 30 481 yes CC-BY-NC-SA no
orv-varlaam-dep TOROT dependency 9 141 yes CC-BY-NC-SA no
Paraguayan Guaraní (gug) 0   0