INESS-logo
Treebank Selection

Treebanks

Tools


Select a set of treebanks to work with. ?
Languages: All · Abaza (0/2) · Abkhazian (0/2) · Afrikaans (0/6) · Akkadian (0/8) · Akuntsu (0/3) · Albanian (0/3) · Amharic (0/5) · Ancient Greek (to 1453) (0/24) · Ancient Hebrew (0/2) · Apurinã (0/3) · Arabic (0/22) · Armenian (0/7) · Assyrian Neo-Aramaic (0/4) · Azerbaijani (0/1) · Bambara (0/5) · Basque (0/11) · Bavarian (0/1) · Beja (0/3) · Belarusian (0/6) · Bengali (0/2) · Bhojpuri (0/4) · Borôro (0/2) · Breton (0/5) · Bulgarian (0/12) · Buriat (0/6) · Catalan (0/9) · Cebuano (0/2) · Chinese (0/36) · Chukot (0/3) · Church Slavic (0/13) · Classical Armenian (0/2) · Coptic (0/7) · Croatian (0/11) · Czech (0/42) · Danish (0/13) · Dutch (0/20) · Egyptian (Ancient) (0/1) · Emerillon (0/2) · English (0/69) · Erzya (0/5) · Estonian (0/16) · Faroese (0/9) · Finnish (0/33) · French (0/47) · Galician (0/18) · Georgian (2/10) · German (2/38) · Gheg Albanian (0/2) · Gothic (0/11) · Guajajára (0/3) · Gujarati (0/1) · Haitian (0/1) · Hausa (0/2) · Hebrew (0/13) · Hindi (0/16) · Hittite (0/2) · Hungarian (0/15) · Icelandic (0/14) · Indonesian (0/21) · Irish (0/16) · Italian (0/47) · Jamamadí (0/2) · Japanese (0/28) · Javanese (0/2) · K'iche' (0/3) · Kangri (0/3) · Karelian (0/4) · Karo (Ethiopia) (0/2) · Kazakh (0/9) · Khunsari (0/3) · Kirghiz (0/3) · Komi (0/10) · Komi-Permyak (0/4) · Korean (0/16) · Latgalian (0/1) · Latin (0/41) · Latvian (0/11) · Ligurian (0/2) · Literary Chinese (0/3) · Lithuanian (0/10) · Livvi (0/4) · Low German (0/3) · Luxembourgish (0/1) · Macedonian (0/1) · Makuráp (0/3) · Malayalam (0/2) · Maltese (0/5) · Manx (0/3) · Marathi (0/6) · Mbyá Guaraní (0/8) · Middle French (ca. 1400-1600) (0/1) · Modern Greek (1453-) (0/15) · Moksha (0/4) · Mundurukú (0/3) · (0/1) · Nayini (0/3) · Neapolitan (0/2) · Nhengatu (0/2) · Nigerian Pidgin (0/5) · Northern Kurdish (0/6) · Northern Sami (0/31) · Norwegian (0/5) · Norwegian Bokmål (0/63) · Norwegian Nynorsk (0/25) · Old English (ca. 450-1100) (0/5) · Old French (842-ca. 1400) (0/6) · Old Irish (to 900) (0/4) · Old Norse (0/7) · Old Russian (0/30) · Old Turkish (0/3) · Paraguayan Guaraní (0/2) · Paumarí (0/1) · Persian (0/14) · Polish (0/43) · Pomak (0/2) · Portuguese (0/35) · Qafar (0/2) · Romanian (0/24) · Russian (0/33) · Sanskrit (0/10) · Saya (0/2) · Scottish Gaelic (0/4) · Serbian (0/6) · Sinhala (0/2) · Skolt Sami (0/4) · Slovak (0/8) · Slovenian (0/20) · Sonha (0/3) · South Levantine Arabic (0/3) · Spanish (0/27) · Swedish (0/28) · Swedish Sign Language (0/7) · Swiss German (0/4) · Tagalog (0/8) · Tamil (0/14) · Tatar (0/2) · Telugu (0/7) · Thai (0/5) · Tswana (0/1) · Tupinambá (0/3) · Turkish (0/42) · Uighur (0/8) · Ukrainian (0/8) · Umbrian (0/2) · Upper Sorbian (0/6) · Urdu (0/9) · Urubú-Kaapor (0/3) · Veps (0/1) · Vietnamese (0/9) · Warlpiri (0/5) · Welsh (0/4) · Western Armenian (0/3) · Western Frisian (0/3) · Wolof (0/7) · Xavánte (0/2) · Xibe (0/2) · Yakut (0/2) · Yoruba (0/5) · Yue Chinese (0/6) · Yupik (0/3) · Zacatlán-Ahuacatlán-Tepetzintla Nahuatl (0/2)
Treebank Collections: All · Acquis (7) · Alpino (1) · BulTreeBank (1) · CLARIN-PL (5) · DELPH-IN (2) · GEGO (4) · GNC (2) · GeoGram (4) · HunGram (4) · ISWOC (9) · JOS (1) · Menotec (7) · Mercurius (1) · NAOB (15) · NDT (6) · NorGram (58) · NorGramBank (40) · Ordbøkene (4) · POLFIE (23) · PROIEL (10) · PaHC (2) · ParGram (11) · ParTMA (15) · Sami-open (15) · Sami-restricted (7) · Sofie (9) · TIGER (3) · TOROT (22) · Universal Dependencies 1.1 (19) · Universal Dependencies 1.2 (36) · Universal Dependencies 1.3 (53) · Universal Dependencies 1.4 (63) · Universal Dependencies 2.0 (63) · Universal Dependencies 2.1 (103) · Universal Dependencies 2.12 (245) · Universal Dependencies 2.14 (283) · Universal Dependencies 2.3 (130) · Universal Dependencies 2.5 (157) · Universal Dependencies 2.8 (200) · WolGram (3) · XPar (2)
Treebank Types: All · lfg (0/125) · constituency (4/19) · constituency-alpino (0/1) · dependency (0/48) · dependency-cg (0/1393) · dependency-tuebadz (0/1) · hpsg (0/2)
Show only Parallel Treebanks
Click on a treebank name below to proceed. All selected treebanks will be available for viewing and searching. | Show treebank descriptions
Selected Name Collection Type Sentences Words Indexed License Downloads
all | none 334 3 891
Georgian (kat) 167   1 698
kat-gego125-con (aligned) GEGO constituency 124 1 419 yes CC-BY no
kat-gego-con (aligned) GEGO constituency 43 279 yes (Accepted) no
German (deu) 167   2 193
deu-gego125-con (aligned) GEGO constituency 124 1 879 yes CC-BY no
deu-gego-con (aligned) GEGO constituency 43 314 yes (Accepted) no