INESS-logo
Treebank Selection

Treebanks

Tools


Select a set of treebanks to work with. ?
Languages: All · Abaza (2/3) · Abkhazian (1/3) · Afrikaans (4/7) · Akkadian (6/10) · Akuntsu (2/4) · Albanian (2/3) · Amharic (4/6) · Ancient Greek (to 1453) (13/27) · Ancient Hebrew (2/3) · Apurinã (2/4) · Arabic (14/25) · Armenian (6/9) · Assyrian Neo-Aramaic (3/5) · Azerbaijani (1/2) · Bambara (4/6) · Basque (6/12) · Bavarian (1/2) · Beja (2/3) · Belarusian (4/7) · Bengali (2/3) · Bhojpuri (3/5) · Borôro (2/3) · Breton (4/6) · Bulgarian (6/13) · Buriat (4/7) · Catalan (5/10) · Cebuano (2/3) · Chinese (24/43) · Chukot (2/4) · Church Slavic (6/14) · Classical Armenian (1/3) · Coptic (5/8) · Croatian (6/12) · Czech (25/48) · Danish (7/14) · Dutch (11/22) · Egyptian (Ancient) (1/2) · Emerillon (2/3) · English (40/80) · Erzya (4/6) · Estonian (10/18) · Faroese (6/11) · Finnish (19/37) · French (29/54) · Galician (11/21) · Gbaya (Central African Republic) (0/1) · Georgian (1/13) · German (17/42) · Gheg Albanian (2/3) · Gothic (6/12) · Guajajára (2/4) · Gujarati (1/2) · Gweno (0/1) · Haitian (1/2) · Hausa (2/4) · Hebrew (8/16) · Hindi (10/18) · Hittite (2/3) · Hungarian (6/16) · Icelandic (9/18) · Indonesian (12/24) · Irish (10/19) · Italian (31/57) · Jamamadí (2/3) · Japanese (21/34) · Javanese (2/3) · K'iche' (2/4) · Kangri (2/4) · Karelian (3/5) · Karo (Ethiopia) (2/3) · Kazakh (5/10) · Khunsari (2/4) · Kirghiz (3/5) · Komi (8/12) · Komi-Permyak (3/5) · Korean (12/20) · Latgalian (1/2) · Latin (23/47) · Latvian (7/13) · Ligurian (2/3) · Literary Chinese (3/5) · Lithuanian (7/12) · Livvi (3/5) · Low German (2/4) · Luxembourgish (1/2) · Macedonian (1/2) · Makuráp (2/4) · Malayalam (2/3) · Maltese (4/6) · Manx (2/4) · Marathi (4/7) · Mbyá Guaraní (6/10) · Middle French (ca. 1400-1600) (1/2) · Modern Greek (1453-) (9/19) · Moksha (3/5) · Mundurukú (2/4) · (0/1) · (1/2) · (0/1) · (0/1) · (0/1) · (0/1) · Nayini (2/4) · Neapolitan (2/3) · Nhengatu (2/3) · Nigerian Pidgin (4/6) · Northern Kurdish (4/7) · Northern Sami (4/32) · Norwegian (0/5) · Norwegian Bokmål (6/66) · Norwegian Nynorsk (6/28) · Old English (ca. 450-1100) (0/5) · Old French (842-ca. 1400) (4/7) · Old Irish (to 900) (4/6) · Old Norse (0/7) · Old Russian (8/34) · Old Turkish (2/4) · Paraguayan Guaraní (2/3) · Paumarí (1/2) · Pech (0/1) · Persian (8/16) · Phrygian (0/1) · Polish (13/46) · Pomak (2/3) · Portuguese (21/42) · Pushto (0/1) · Qafar (2/3) · Romanian (16/29) · Russian (21/38) · Sanskrit (7/12) · Saya (2/3) · Scottish Gaelic (3/5) · Serbian (4/7) · Sinhala (2/3) · Skolt Sami (3/5) · Slovak (5/9) · Slovenian (11/22) · Sonha (2/4) · South Levantine Arabic (2/4) · Spanish (16/31) · Spanish Sign Language (0/1) · Swedish (16/31) · Swedish Sign Language (5/8) · Swiss German (3/5) · Tagalog (6/10) · Tamil (8/16) · Tatar (2/3) · Telugu (5/9) · Thai (4/6) · Tswana (1/2) · Tupinambá (2/4) · Turkish (28/54) · Uighur (5/9) · Ukrainian (5/8) · Umbrian (2/3) · Upper Sorbian (4/7) · Urdu (4/10) · Urubú-Kaapor (2/4) · Uzbek (0/1) · Veps (1/2) · Vietnamese (6/11) · Warlpiri (4/6) · Welsh (3/5) · Western Armenian (2/4) · Western Frisian (2/4) · Wolof (3/8) · Xavánte (2/3) · Xibe (2/3) · Yakut (2/3) · Yoruba (4/6) · Yue Chinese (4/7) · Yupik (2/4) · Zacatlán-Ahuacatlán-Tepetzintla Nahuatl (2/3)
Treebank Collections: All · Acquis (6/7) · Alpino (0/1) · BulTreeBank (1) · CLARIN-PL (0/5) · DELPH-IN (0/2) · GEGO (4) · GNC (2) · GeoGram (0/4) · HunGram (0/4) · ISWOC (0/9) · JOS (1) · Menotec (0/7) · Mercurius (1) · NAOB (0/15) · NDT (2/6) · NorGram (0/58) · NorGramBank (0/40) · Ordbøkene (0/8) · POLFIE (0/23) · PROIEL (0/10) · PaHC (2) · ParGram (0/11) · ParTMA (0/15) · Sami-open (15) · Sami-restricted (7) · Sofie (7/9) · TIGER (1/3) · TOROT (0/22) · Universal Dependencies 1.1 (19) · Universal Dependencies 1.2 (36) · Universal Dependencies 1.3 (53) · Universal Dependencies 1.4 (63) · Universal Dependencies 2.0 (63) · Universal Dependencies 2.1 (103) · Universal Dependencies 2.12 (245) · Universal Dependencies 2.14 (283) · Universal Dependencies 2.15 (297) · Universal Dependencies 2.3 (130) · Universal Dependencies 2.5 (157) · Universal Dependencies 2.8 (200) · WolGram (0/3) · XPar (0/2)
Treebank Types: All · lfg (15/129) · constituency (5/19) · constituency-alpino (0/1) · dependency (0/48) · dependency-cg (916/1691) · dependency-tuebadz (0/1) · hpsg (2)
Show only Parallel Treebanks

Show custom treebank:
Click on a treebank name below to proceed. All selected treebanks will be available for viewing and searching. | Show treebank descriptions
Selected Name Collection Type Sentences Words Indexed License Downloads
all | none 223 995 3 249 203
Abaza (abq) 0   0
Abkhazian (abk) 0   0
Russian (rus) 223 995   3 249 203
rus-ud-1.4-dep Universal Dependencies 1.4 dependency-cg 5 030 85 189 no unspecified no
rus-ud-gsd-2.3-dep Universal Dependencies 2.3 dependency-cg 5 030 85 189 no unspecified no
rus-ud-gsd-2.5-dep Universal Dependencies 2.5 dependency-cg 5 030 83 783 no (Accepted) no
rus-ud-pud-2.3-dep Universal Dependencies 2.3 dependency-cg 1 000 16 505 no unspecified no
rus-ud-pud-2.5-dep Universal Dependencies 2.5 dependency-cg 1 000 16 490 no (Accepted) no
rus-ud-rnc-2.5-dep Universal Dependencies 2.5 dependency-cg 604 15 823 no (Accepted) no
rus-ud-syn-tag-rus-1.4-dep Universal Dependencies 1.4 dependency-cg 60 551 896 887 no unspecified no
rus-ud-syn-tag-rus-2.3-dep Universal Dependencies 2.3 dependency-cg 61 889 925 552 no unspecified no
rus-ud-syn-tag-rus-2.5-dep Universal Dependencies 2.5 dependency-cg 61 889 924 103 no (Accepted) no
rus-ud-taiga-2.3-dep Universal Dependencies 2.3 dependency-cg 1 764 17 076 no unspecified no
rus-ud-taiga-2.5-dep Universal Dependencies 2.5 dependency-cg 3 264 32 830 no (Accepted) no
rus-ud-torot-2.5-dep Universal Dependencies 2.5 dependency-cg 16 944 149 776 no (Accepted) no