INESS-logo
Treebank Selection

Treebanks

Tools


Select a set of treebanks to work with. ?
Languages: All · Abaza (0/1) · Abkhazian (0/1) · Afrikaans (0/5) · Akkadian (0/6) · Akuntsu (0/2) · Albanian (0/2) · Amharic (0/4) · Ancient Greek (to 1453) (2/21) · Ancient Hebrew (0/1) · Apurinã (0/2) · Arabic (1/19) · Armenian (0/5) · Assyrian Neo-Aramaic (0/3) · Bambara (0/4) · Basque (1/10) · Beja (0/2) · Belarusian (0/5) · Bengali (0/1) · Bhojpuri (0/3) · Borôro (0/1) · Breton (0/4) · Bulgarian (1/11) · Buriat (0/5) · Catalan (0/8) · Cebuano (0/1) · Chinese (0/29) · Chukot (0/2) · Church Slavic (1/12) · Classical Armenian (0/1) · Coptic (0/6) · Croatian (1/10) · Czech (1/36) · Danish (1/12) · Dutch (1/18) · Emerillon (0/1) · English (1/58) · Erzya (0/4) · Estonian (1/14) · Faroese (0/7) · Finnish (2/29) · French (1/40) · Galician (0/15) · Georgian (0/9) · German (1/34) · Gheg Albanian (0/1) · Gothic (1/10) · Guajajára (0/2) · Hebrew (1/11) · Hindi (1/14) · Hittite (0/1) · Hungarian (1/14) · Icelandic (0/10) · Indonesian (1/18) · Irish (1/13) · Italian (1/37) · Jamamadí (0/1) · Japanese (0/22) · Javanese (0/1) · K'iche' (0/2) · Kangri (0/2) · Karelian (0/3) · Karo (Ethiopia) (0/1) · Kazakh (0/8) · Khunsari (0/2) · Kirghiz (0/1) · Komi (0/8) · Komi-Permyak (0/3) · Korean (0/13) · Latin (3/35) · Latvian (0/9) · Ligurian (0/1) · Literary Chinese (0/1) · Lithuanian (0/8) · Livvi (0/3) · Low German (0/2) · Makuráp (0/2) · Malayalam (0/1) · Maltese (0/4) · Manx (0/2) · Marathi (0/5) · Mbyá Guaraní (0/6) · Modern Greek (1453-) (1/12) · Moksha (0/3) · Mundurukú (0/2) · Nayini (0/2) · Neapolitan (0/1) · Nhengatu (0/1) · Nigerian Pidgin (0/4) · Northern Kurdish (0/5) · Northern Sami (0/30) · Norwegian (0/5) · Norwegian Bokmål (1/62) · Norwegian Nynorsk (0/24) · Old English (ca. 450-1100) (0/5) · Old French (842-ca. 1400) (0/5) · Old Irish (to 900) (0/2) · Old Norse (0/7) · Old Russian (0/26) · Old Turkish (0/2) · Paraguayan Guaraní (0/1) · Persian (1/12) · Polish (1/40) · Pomak (0/1) · Portuguese (1/29) · Qafar (0/1) · Romanian (1/19) · Russian (0/28) · Sanskrit (0/8) · Saya (0/1) · Scottish Gaelic (0/3) · Serbian (0/5) · Sinhala (0/1) · Skolt Sami (0/3) · Slovak (0/7) · Slovenian (1/18) · Sonha (0/2) · South Levantine Arabic (0/2) · Spanish (1/23) · Swedish (1/25) · Swedish Sign Language (0/6) · Swiss German (0/3) · Tagalog (0/6) · Tamil (1/12) · Tatar (0/1) · Telugu (0/5) · Thai (0/4) · Tupinambá (0/2) · Turkish (0/30) · Uighur (0/7) · Ukrainian (0/7) · Umbrian (0/1) · Upper Sorbian (0/5) · Urdu (0/8) · Urubú-Kaapor (0/2) · Vietnamese (0/7) · Warlpiri (0/4) · Welsh (0/3) · Western Armenian (0/2) · Western Frisian (0/2) · Wolof (0/6) · Xavánte (0/1) · Xibe (0/1) · Yakut (0/1) · Yoruba (0/4) · Yue Chinese (0/5) · Yupik (0/2) · Zacatlán-Ahuacatlán-Tepetzintla Nahuatl (0/1)
Treebank Collections: All · Acquis (7) · Alpino (1) · BulTreeBank (1) · CLARIN-PL (5) · DELPH-IN (2) · GEGO (4) · GNC (2) · GeoGram (4) · HunGram (4) · ISWOC (9) · JOS (1) · Menotec (7) · Mercurius (1) · NAOB (15) · NDT (6) · NorGram (58) · NorGramBank (40) · POLFIE (23) · PROIEL (10) · PaHC (2) · ParGram (11) · ParTMA (15) · Sami-open (15) · Sami-restricted (7) · Sofie (9) · TIGER (3) · TOROT (22) · Universal Dependencies 1.1 (19) · Universal Dependencies 1.2 (36) · Universal Dependencies 1.3 (53) · Universal Dependencies 1.4 (63) · Universal Dependencies 2.0 (63) · Universal Dependencies 2.1 (103) · Universal Dependencies 2.12 (245) · Universal Dependencies 2.3 (130) · Universal Dependencies 2.5 (157) · Universal Dependencies 2.8 (200) · WolGram (3) · XPar (2)
Treebank Types: All · lfg (0/125) · constituency (0/19) · constituency-alpino (0/1) · dependency (0/48) · dependency-cg (36/1110) · dependency-tuebadz (0/1) · hpsg (0/2)
Show only Parallel Treebanks
Click on a treebank name below to proceed. All selected treebanks will be available for viewing and searching. | Show treebank descriptions
Selected Name Collection Type Sentences Words Indexed License Downloads
all | none 420 517 6 424 302
Ancient Greek (to 1453) (grc) 32 854   427 286
grc-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 221 220 323 no unspecified no
grc-ud-proiel-1.2-dep Universal Dependencies 1.2 dependency-cg 16 633 206 963 no unspecified no
Arabic (ara) 7 664   265 943
ara-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 7 664 265 943 no unspecified no
Basque (eus) 8 993   102 105
eus-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 8 993 102 105 no unspecified no
Bulgarian (bul) 11 138   135 149
bul-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 11 138 135 149 no unspecified no
Church Slavic (chu) 6 346   57 507
chu-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 6 346 57 507 no unspecified no
Croatian (hrv) 3 957   76 914
hrv-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 3 957 76 914 no unspecified no
Czech (ces) 87 913   1 295 498
ces-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 87 913 1 295 498 no unspecified no
Danish (dan) 5 512   87 563
dan-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 5 512 87 563 no unspecified no
Dutch (nld) 13 735   187 567
nld-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 13 735 187 567 no unspecified no
English (eng) 16 622   227 973
eng-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 622 227 973 no unspecified no
Estonian (est) 1 315   8 033
est-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 1 315 8 033 no unspecified no
Finnish (fin) 32 373   293 287
fin-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 13 581 155 565 no unspecified no
fin-ud-ftb-1.2-dep Universal Dependencies 1.2 dependency-cg 18 792 137 722 no unspecified no
French (fra) 16 446   345 854
fra-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 446 345 854 no unspecified no
German (deu) 15 894   258 577
deu-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 15 894 258 577 no unspecified no
Gothic (got) 5 450   56 134
got-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 5 450 56 134 no unspecified no
Hebrew (heb) 6 216   98 830
heb-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 6 216 98 830 no unspecified no
Hindi (hin) 16 647   346 259
hin-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 647 346 259 no unspecified no
Hungarian (hun) 1 299   22 898
hun-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 1 299 22 898 no unspecified no
Indonesian (ind) 5 593   105 469
ind-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 5 593 105 469 no unspecified no
Irish (gle) 1 020   21 780
gle-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 1 020 21 780 no unspecified no
Italian (ita) 12 677   222 775
ita-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 12 677 222 775 no unspecified no
Latin (lat) 33 546   432 082
lat-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 3 269 42 913 no unspecified no
lat-ud-itt-1.2-dep Universal Dependencies 1.2 dependency-cg 15 295 223 966 no unspecified no
lat-ud-proiel-1.2-dep Universal Dependencies 1.2 dependency-cg 14 982 165 203 no unspecified no
Modern Greek (1453-) (ell) 2 411   53 496
ell-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 2 411 53 496 no unspecified no
Norwegian Bokmål (nob) 20 045   276 789
nob-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 20 045 276 789 no unspecified no
Persian (fas) 5 997   143 992
fas-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 5 997 143 992 no unspecified no
Polish (pol) 8 227   70 986
pol-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 8 227 70 986 no unspecified no
Portuguese (por) 9 359   197 041
por-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 9 359 197 041 no unspecified no
Romanian (ron) 633   10 564
ron-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 633 10 564 no unspecified no
Slovenian (slv) 7 996   122 436
slv-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 7 996 122 436 no unspecified no
Spanish (spa) 16 013   377 286
spa-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 013 377 286 no unspecified no
Swedish (swe) 6 026   87 626
swe-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 6 026 87 626 no unspecified no
Tamil (tam) 600   8 603
tam-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 600 8 603 no unspecified no