OLAC Record oai:lindat.mff.cuni.cz:11234/1-3308 |
Metadata | ||
Title: | FAUST 0.5 | |
Bibliographic Citation: | http://hdl.handle.net/11234/1-3308 | |
Creator: | Hajič, Jan | |
Mareček, David | ||
Fučíková, Eva | ||
Cinková, Silvie | ||
Štěpánek, Jan | ||
Mikulová, Marie | ||
Date (W3CDTF): | 2021-10-15T13:55:25Z | |
Date Available: | 2021-10-15T13:55:25Z | |
Description: | Syntactic (including deep-syntactic - tectogrammatical) annotation of user-generated noisy sentences. The annotation was made on Czech-English and English-Czech Faust Dev/Test sets. The English data includes manual annotations of English reference translations of Czech source texts. This texts were translated independently by two translators. After some necessary cleanings, 1000 segments were randomly selected for manual annotation. Both the reference translations were annotated, which means 2000 annotated segments in total. The Czech data includes manual annotations of Czech reference translations of English source texts. This texts were translated independently by three translators. After some necessary cleanings, 1000 segments were randomly selected for manual annotation. All three reference translations were annotated, which means 3000 annotated segments in total. Faust is part of PDT-C 1.0 (http://hdl.handle.net/11234/1-3185). | |
Identifier (URI): | http://hdl.handle.net/11234/1-3308 | |
Language: | English | |
Czech | ||
Language (ISO639): | eng | |
ces | ||
Publisher: | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) | |
Rights: | Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) | |
http://creativecommons.org/licenses/by-nc/4.0/ | ||
Subject: | tectogrammatics | |
treebank | ||
parallel corpus | ||
noisy texts | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11234/1-3308 | |
DateStamp: | 2021-10-15 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Hajič, Jan; Mareček, David; Fučíková, Eva; Cinková, Silvie; Štěpánek, Jan; Mikulová, Marie. 2021. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL). | |
Terms: | area_Europe country_CZ country_GB dcmi_Text iso639_ces iso639_eng olac_primary_text |