OLAC Record oai:lindat.mff.cuni.cz:11234/1-2899 |
Metadata | ||
Title: | CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data | |
Bibliographic Citation: | http://hdl.handle.net/11234/1-2899 | |
Creator: | Zeman, Daniel | |
Straka, Milan | ||
Date (W3CDTF): | 2018-11-28T13:44:33Z | |
Date Available: | 2018-11-28T13:44:33Z | |
Description: | CoNLL 2017 and 2018 shared tasks: Multilingual Parsing from Raw Text to Universal Dependencies This package contains the test data in the form in which they ware presented to the participating systems: raw text files and files preprocessed by UDPipe. The metadata.json files contain lists of files to process and to output; README files in the respective folders describe the syntax of metadata.json. For full training, development and gold standard test data, see Universal Dependencies 2.0 (CoNLL 2017) Universal Dependencies 2.2 (CoNLL 2018) See the download links at http://universaldependencies.org/. For more information on the shared tasks, see http://universaldependencies.org/conll17/ http://universaldependencies.org/conll18/ Contents: conll17-ud-test-2017-05-09 ... CoNLL 2017 test data conll18-ud-test-2018-05-06 ... CoNLL 2018 test data conll18-ud-test-2018-05-06-for-conll17 ... CoNLL 2018 test data with metadata and filenames modified so that it is digestible by the 2017 systems. | |
Identifier (URI): | http://hdl.handle.net/11234/1-2899 | |
Language: | Afrikaans | |
Arabic | ||
Breton | ||
Bulgarian | ||
Russia Buriat | ||
Catalan | ||
Czech | ||
Church Slavic | ||
Danish | ||
German | ||
Modern Greek (1453-) | ||
English | ||
Estonian | ||
Basque | ||
Faroese | ||
Persian | ||
Finnish | ||
French | ||
Old French (842-ca. 1400) | ||
Irish | ||
Galician | ||
Gothic | ||
Ancient Greek (to 1453) | ||
Hebrew | ||
Hindi | ||
Croatian | ||
Upper Sorbian | ||
Hungarian | ||
Armenian | ||
Indonesian | ||
Italian | ||
Japanese | ||
Kazakh | ||
Northern Kurdish | ||
Korean | ||
Latin | ||
Latvian | ||
Dutch | ||
Norwegian | ||
Nigerian Pidgin | ||
Polish | ||
Portuguese | ||
Romanian | ||
Russian | ||
Slovak | ||
Slovenian | ||
Northern Sami | ||
Spanish | ||
Serbian | ||
Swedish | ||
Thai | ||
Turkish | ||
Uighur | ||
Ukrainian | ||
Urdu | ||
Vietnamese | ||
Chinese | ||
Language (ISO639): | afr | |
ara | ||
bre | ||
bul | ||
bxr | ||
cat | ||
ces | ||
chu | ||
dan | ||
deu | ||
ell | ||
eng | ||
est | ||
eus | ||
fao | ||
fas | ||
fin | ||
fra | ||
fro | ||
gle | ||
glg | ||
got | ||
grc | ||
heb | ||
hin | ||
hrv | ||
hsb | ||
hun | ||
hye | ||
ind | ||
ita | ||
jpn | ||
kaz | ||
kmr | ||
kor | ||
lat | ||
lav | ||
nld | ||
nor | ||
pcm | ||
pol | ||
por | ||
ron | ||
rus | ||
slk | ||
slv | ||
sme | ||
spa | ||
srp | ||
swe | ||
tha | ||
tur | ||
uig | ||
ukr | ||
urd | ||
vie | ||
zho | ||
Publisher: | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) | |
Rights: | Licence Universal Dependencies v2.2 | |
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2 | ||
Subject: | tokenization | |
word segmentation | ||
morphology | ||
tagging | ||
syntax | ||
parsing | ||
universal dependencies | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11234/1-2899 | |
DateStamp: | 2021-06-29 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Zeman, Daniel; Straka, Milan. 2018. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL). | |
Terms: | area_Africa area_Asia area_Europe country_AM country_BG country_CN country_CZ country_DE country_DK country_ES country_FI country_FR country_GB country_GR country_HR country_HU country_ID country_IE country_IL country_IN country_IT country_JP country_KR country_KZ country_NG country_NL country_NO country_PK country_PL country_PT country_RO country_RS country_RU country_SE country_SI country_SK country_TH country_TR country_UA country_VA country_VN country_ZA dcmi_Text iso639_afr iso639_ara iso639_bre iso639_bul iso639_bxr iso639_cat iso639_ces iso639_chu iso639_dan iso639_deu iso639_ell iso639_eng iso639_est iso639_eus iso639_fao iso639_fas iso639_fin iso639_fra iso639_fro iso639_gle iso639_glg iso639_got iso639_grc iso639_heb iso639_hin iso639_hrv iso639_hsb iso639_hun iso639_hye iso639_ind iso639_ita iso639_jpn iso639_kaz iso639_kmr iso639_kor iso639_lat iso639_lav iso639_nld iso639_nor iso639_pcm iso639_pol iso639_por iso639_ron iso639_rus iso639_slk iso639_slv iso639_sme iso639_spa iso639_srp iso639_swe iso639_tha iso639_tur iso639_uig iso639_ukr iso639_urd iso639_vie iso639_zho olac_primary_text |