INESS-logo
Treebank Selection

Treebanks

Tools


Select a set of treebanks to work with. ?
Languages: All · Afrikaans (0/1) · Ancient Greek (to 1453) (2/13) · Arabic (1/7) · Basque (1/6) · Belarusian (0/1) · Bulgarian (2/7) · Buriat (0/1) · Catalan (0/4) · Chinese (0/7) · Church Slavic (1/8) · Classical Armenian (0/1) · Coptic (0/2) · Croatian (1/7) · Czech (1/16) · Danish (1/7) · Dutch (1/9) · English (1/21) · Estonian (1/6) · Finnish (2/13) · French (1/12) · Galician (0/7) · Georgian (2/5) · German (1/16) · Gothic (1/6) · Hebrew (1/6) · Hindi (1/6) · Hungarian (1/8) · Icelandic (0/1) · Indonesian (1/8) · Irish (1/6) · Italian (1/11) · Japanese (0/4) · Kazakh (0/4) · Korean (0/1) · Latin (3/19) · Latvian (0/4) · Lithuanian (0/1) · Marathi (0/1) · Modern Greek (1453-) (1/7) · (1) · Northern Kurdish (0/1) · Northern Sami (15/16) · Norwegian Bokmål (1/9) · Norwegian Nynorsk (0/3) · Old English (ca. 450-1100) (0/5) · Old French (842-ca. 1400) (0/1) · Old Norse (0/4) · Old Russian (0/20) · Persian (1/6) · Polish (1/11) · Portuguese (1/15) · Romanian (1/6) · Russian (0/10) · Sanskrit (0/2) · Serbian (0/1) · Slovak (0/3) · Slovenian (2/10) · Spanish (0/10) · Swedish (1/12) · Swedish Sign Language (0/2) · Tamil (1/4) · Telugu (0/1) · Turkish (0/6) · Uighur (0/3) · Ukrainian (0/3) · Upper Sorbian (0/1) · Urdu (0/4) · Vietnamese (0/3) · Wolof (0/1) · Yue Chinese (0/1)
Treebank Collections: All · BulTreeBank (0/1) · CLARIN-PL (0/3) · GEGO (0/4) · GeoGram (2) · HunGram (0/2) · ISWOC (0/9) · JOS (0/1) · Menotec (0/4) · Mercurius (0/1) · NorGram (0/3) · POLFIE (0/5) · PROIEL (0/10) · ParGram (3/12) · ParTMA (4/14) · Sami-open (15) · Sofie (1/9) · TOROT (0/22) · TiGer (0/3) · Universal Dependencies 1.1 (6/19) · Universal Dependencies 1.2 (12/36) · Universal Dependencies 1.3 (17/53) · Universal Dependencies 1.4 (20/63) · Universal Dependencies 2.0 (23/63) · Universal Dependencies 2.1 (34/103) · WolGram (1)
Treebank Types: All · lfg (1/3) · constituency (2/13) · dependency (0/45) · dependency-cg (27/354)
Show only Parallel Treebanks

Show custom treebank:
Click on a treebank name below to proceed. All selected treebanks will be available for viewing and searching. | Show treebank descriptions
Selected Name Collection Type Sentences Words Indexed License Downloads
all | none 923 082 11 061 236
Arabic (ara) 7 664   265 943
ara-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 7 664 265 943 yes unspecified no
Czech (ces) 87 913   1 295 498
ces-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 87 913 1 295 498 yes unspecified no
Church Slavic (chu) 6 346   57 507
chu-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 6 346 57 507 yes unspecified no
Danish (dan) 5 614   89 362
dan-jrc-acquis-dep (aligned) Acquis dependency-cg 102 1 799 yes CC-BY ConLL
dan-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 5 512 87 563 yes unspecified no
(esp) 16 013   377 286
esp-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 013 377 286 yes unspecified no
Estonian (est) 1 315   8 033
est-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 1 315 8 033 yes unspecified no
French (fra) 16 446   345 854
fra-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 446 345 854 yes unspecified no
Irish (gle) 1 020   21 780
gle-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 1 020 21 780 yes unspecified no
Ancient Greek (to 1453) (grc) 32 854   427 286
grc-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 16 221 220 323 yes unspecified no
grc-ud-proiel-1.2-dep Universal Dependencies 1.2 dependency-cg 16 633 206 963 yes unspecified no
Indonesian (ind) 5 593   105 469
ind-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 5 593 105 469 yes unspecified no
Italian (ita) 12 677   222 775
ita-ud-1.2-dep Universal Dependencies 1.2 dependency-cg 12 677 222 775 yes unspecified no
Georgian (kat) 1 077   10 146
kat-pargram (aligned) GeoGram, ParGram lfg 52 231 yes CC-BY (Accepted) no
kat-sofie (aligned) GeoGram, Sofie lfg 1 025 9 915 yes unspecified no
Northern Sami (sme) 728 550   7 834 297
sme-admin Sami-open dependency-cg 222 475 2 229 099 yes (Accepted) no
sme-dan-facta Sami-open dependency-cg 114 2 054 yes (Accepted) no
sme-eng-facta Sami-open dependency-cg 49 749 yes (Accepted) no
sme-facta Sami-open dependency-cg 79 998 827 491 yes (Accepted) no
sme-fin-admin Sami-open dependency-cg 1 185 8 819 yes (Accepted) no
sme-fin-facta Sami-open dependency-cg 584 5 675 yes (Accepted) no
sme-laws Sami-open dependency-cg 24 627 327 466 yes (Accepted) no
sme-nno-admin Sami-open dependency-cg 3 151 31 064 yes (Accepted) no
sme-nno-facta Sami-open dependency-cg 4 866 55 164 yes (Accepted) no
sme-nob-admin Sami-open dependency-cg 326 382 3 563 954 yes (Accepted) no
sme-nob-facta Sami-open dependency-cg 32 510 317 407 yes (Accepted) no
sme-nob-laws Sami-open dependency-cg 32 005 459 027 yes (Accepted) no
sme-no-facta Sami-open dependency-cg 22 288 yes (Accepted) no
sme-sme-admin Sami-open dependency-cg 550 5 531 yes (Accepted) no
sme-sme-facta Sami-open dependency-cg 32 509 yes (Accepted) no

Design & implementation: Paul Meurer, CLARINO Bergen Centre, 2019