INESS-logo
Treebank Selection

Treebanks

Tools


Select a set of treebanks to work with. ?
Languages: All · Abaza (0/3) · Abkhazian (0/3) · Afrikaans (1/7) · Akkadian (1/10) · Akuntsu (0/4) · Albanian (0/3) · Amharic (1/6) · Ancient Greek (to 1453) (6/27) · Ancient Hebrew (0/3) · Apurinã (0/4) · Arabic (5/25) · Armenian (1/9) · Assyrian Neo-Aramaic (1/5) · Azerbaijani (0/2) · Bambara (1/6) · Basque (3/12) · Bavarian (0/2) · Beja (0/3) · Belarusian (1/7) · Bengali (0/3) · Bhojpuri (1/5) · Borôro (0/3) · Breton (1/6) · Bulgarian (3/13) · Buriat (1/7) · Catalan (3/10) · Cebuano (0/3) · Chinese (8/43) · Chukot (0/4) · Church Slavic (3/14) · Classical Armenian (0/3) · Coptic (2/8) · Croatian (3/12) · Czech (11/48) · Danish (3/14) · Dutch (6/22) · Egyptian (Ancient) (0/2) · Emerillon (0/3) · English (14/80) · Erzya (1/6) · Estonian (4/18) · Faroese (1/11) · Finnish (7/37) · French (9/54) · Galician (5/21) · Gbaya (Central African Republic) (0/1) · Georgian (0/13) · German (7/42) · Gheg Albanian (0/3) · Gothic (3/12) · Guajajára (0/4) · Gujarati (0/2) · Gweno (0/1) · Haitian (0/2) · Hausa (0/4) · Hebrew (3/16) · Hindi (4/18) · Hittite (0/3) · Hungarian (3/16) · Icelandic (0/18) · Indonesian (4/24) · Irish (3/19) · Italian (8/57) · Jamamadí (0/3) · Japanese (5/34) · Javanese (0/3) · K'iche' (0/4) · Kangri (0/4) · Karelian (1/5) · Karo (Ethiopia) (0/3) · Kazakh (3/10) · Khunsari (0/4) · Kirghiz (0/5) · Komi (2/12) · Komi-Permyak (1/5) · Korean (3/20) · Latgalian (0/2) · Latin (9/47) · Latvian (3/13) · Ligurian (0/3) · Literary Chinese (0/5) · Lithuanian (2/12) · Livvi (1/5) · Low German (0/4) · Luxembourgish (0/2) · Macedonian (0/2) · Makuráp (0/4) · Malayalam (0/3) · Maltese (1/6) · Manx (0/4) · Marathi (1/7) · Mbyá Guaraní (2/10) · Middle French (ca. 1400-1600) (0/2) · Modern Greek (1453-) (3/19) · Moksha (1/5) · Mundurukú (0/4) · (0/1) · (0/2) · (0/1) · (0/1) · (0/1) · (0/1) · Nayini (0/4) · Neapolitan (0/3) · Nhengatu (0/3) · Nigerian Pidgin (1/6) · Northern Kurdish (1/7) · Northern Sami (1/32) · Norwegian (5) · Norwegian Bokmål (35/66) · Norwegian Nynorsk (8/28) · Old English (ca. 450-1100) (5) · Old French (842-ca. 1400) (2/7) · Old Irish (to 900) (0/6) · Old Norse (0/7) · Old Russian (0/34) · Old Turkish (0/4) · Paraguayan Guaraní (0/3) · Paumarí (0/2) · Pech (0/1) · Persian (3/16) · Phrygian (0/1) · Polish (5/46) · Pomak (0/3) · Portuguese (11/42) · Pushto (0/1) · Qafar (0/3) · Romanian (5/29) · Russian (10/38) · Sanskrit (2/12) · Saya (0/3) · Scottish Gaelic (1/5) · Serbian (1/7) · Sinhala (0/3) · Skolt Sami (1/5) · Slovak (2/9) · Slovenian (6/22) · Sonha (0/4) · South Levantine Arabic (0/4) · Spanish (7/31) · Spanish Sign Language (0/1) · Swedish (7/31) · Swedish Sign Language (2/8) · Swiss German (1/5) · Tagalog (1/10) · Tamil (3/16) · Tatar (0/3) · Telugu (1/9) · Thai (1/6) · Tswana (0/2) · Tupinambá (0/4) · Turkish (5/54) · Uighur (2/9) · Ukrainian (2/8) · Umbrian (0/3) · Upper Sorbian (1/7) · Urdu (1/10) · Urubú-Kaapor (0/4) · Uzbek (0/1) · Veps (0/2) · Vietnamese (2/11) · Warlpiri (1/6) · Welsh (1/5) · Western Armenian (0/4) · Western Frisian (0/4) · Wolof (1/8) · Xavánte (0/3) · Xibe (0/3) · Yakut (0/3) · Yoruba (1/6) · Yue Chinese (1/7) · Yupik (0/4) · Zacatlán-Ahuacatlán-Tepetzintla Nahuatl (0/3)
Treebank Collections: All · Acquis (7) · Alpino (1) · BulTreeBank (1) · CLARIN-PL (5) · DELPH-IN (2) · GEGO (4) · GNC (2) · GeoGram (4) · HunGram (4) · ISWOC (9) · JOS (1) · Menotec (7) · Mercurius (1) · NAOB (15) · NDT (6) · NorGram (58) · NorGramBank (40) · Ordbøkene (8) · POLFIE (23) · PROIEL (10) · PaHC (2) · ParGram (11) · ParTMA (15) · Sami-open (15) · Sami-restricted (7) · Sofie (9) · TIGER (3) · TOROT (22) · Universal Dependencies 1.1 (19) · Universal Dependencies 1.2 (36) · Universal Dependencies 1.3 (53) · Universal Dependencies 1.4 (63) · Universal Dependencies 2.0 (63) · Universal Dependencies 2.1 (103) · Universal Dependencies 2.12 (245) · Universal Dependencies 2.14 (283) · Universal Dependencies 2.15 (297) · Universal Dependencies 2.3 (130) · Universal Dependencies 2.5 (157) · Universal Dependencies 2.8 (200) · WolGram (3) · XPar (2)
Treebank Types: All · lfg (43/129) · constituency (1/19) · constituency-alpino (0/1) · dependency (9/48) · dependency-cg (273/1691) · dependency-tuebadz (0/1) · hpsg (0/2)
Show only Parallel Treebanks

Show custom treebank:
Click on a treebank name below to proceed. All selected treebanks will be available for viewing and searching. | Show treebank descriptions
Selected Name Collection Type Sentences Words Indexed License Downloads
all | none 150 489 2 150 074
Afrikaans (afr) 1 934   44 799
afr-ud-2.5-dep Universal Dependencies 2.5 dependency-cg 1 934 44 799 no (Accepted) no
Akkadian (akk) 101   1 852
akk-ud-2.5-dep Universal Dependencies 2.5 dependency-cg 101 1 852 no (Accepted) no
Amharic (amh) 1 074   5 105
amh-ud-2.5-dep Universal Dependencies 2.5 dependency-cg 1 074 5 105 no (Accepted) no
Ancient Greek (to 1453) (grc) 96 707   1 252 264
grc-ud-1.3-dep Universal Dependencies 1.3 dependency-cg 16 221 220 323 no unspecified no
grc-ud-1.4-dep Universal Dependencies 1.4 dependency-cg 16 221 220 323 no unspecified no
grc-ud-perseus-2.5-dep Universal Dependencies 2.5 dependency-cg 13 919 183 693 no (Accepted) no
grc-ud-proiel-1.3-dep Universal Dependencies 1.3 dependency-cg 16 633 206 963 no unspecified no
grc-ud-proiel-1.4-dep Universal Dependencies 1.4 dependency-cg 16 633 206 963 no unspecified no
grc-ud-proiel-2.5-dep Universal Dependencies 2.5 dependency-cg 17 080 213 999 no (Accepted) no
Arabic (ara) 43 730   714 854
ara-ud-1.3-dep Universal Dependencies 1.3 dependency-cg 7 664 225 517 no unspecified no
ara-ud-1.4-dep Universal Dependencies 1.4 dependency-cg 7 664 225 517 no unspecified no
ara-ud-nyuad-2.5-dep Universal Dependencies 2.5 dependency-cg 19 738 19 738 no (Accepted) no
ara-ud-padt-2.5-dep Universal Dependencies 2.5 dependency-cg 7 664 225 517 no (Accepted) no
ara-ud-pud-2.5-dep Universal Dependencies 2.5 dependency-cg 1 000 18 565 no (Accepted) no
Armenian (hye) 2 502   45 040
hye-ud-2.5-dep Universal Dependencies 2.5 dependency-cg 2 502 45 040 no (Accepted) no
Assyrian Neo-Aramaic (aii) 57   388
aii-ud-2.5-dep Universal Dependencies 2.5 dependency-cg 57 388 no (Accepted) no
Serbian (srp) 4 384   85 772
srp-ud-2.5-dep Universal Dependencies 2.5 dependency-cg 4 384 85 772 no (Accepted) no