OLAC Record oai:lindat.mff.cuni.cz:11234/1-1458 |
Metadata | ||
Title: | Czech-English Parallel Corpus 1.0 (CzEng 1.0) | |
Bibliographic Citation: | http://hdl.handle.net/11234/1-1458 | |
Creator: | Bojar, Ondřej | |
Žabokrtský, Zdeněk | ||
Dušek, Ondřej | ||
Galuščáková, Petra | ||
Majliš, Martin | ||
Mareček, David | ||
Maršík, Jiří | ||
Novák, Michal | ||
Popel, Martin | ||
Tamchyna, Aleš | ||
Date (W3CDTF): | 2014-11-10T08:23:49Z | |
Date Available: | 2014-11-10T08:23:49Z | |
Description: | CzEng 1.0 is the fourth release of a sentence-parallel Czech-English corpus compiled at the Institute of Formal and Applied Linguistics (ÚFAL) freely available for non-commercial research purposes. CzEng 1.0 contains 15 million parallel sentences (233 million English and 206 million Czech tokens) from seven different types of sources automatically annotated at surface and deep (a- and t-) layers of syntactic representation. | |
EuroMatrix Plus (FP7-ICT-2007-3-231720 of the EU and 7E09003+7E11051 of the Ministry of Education, Youth and Sports of the Czech Republic), Faust (FP7-ICT-2009-4-247762 of the EU and 7E11041 of the Ministry of Education, Youth and Sports of the Czech Republic), GAČR P406/10/P259, GAUK 116310, GAUK 4226/2011 | ||
Identifier (URI): | http://hdl.handle.net/11234/1-1458 | |
Language: | Czech | |
English | ||
Language (ISO639): | ces | |
eng | ||
Publisher: | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) | |
Replaces (URI): | http://hdl.handle.net/11858/00-097C-0000-0001-4916-9 | |
Rights: | Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) | |
http://creativecommons.org/licenses/by-nc-sa/3.0/ | ||
Subject: | corpus | |
parallel corpus | ||
treebank | ||
alignment | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11234/1-1458 | |
DateStamp: | 2021-06-29 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Bojar, Ondřej; Žabokrtský, Zdeněk; Dušek, Ondřej; Galuščáková, Petra; Majliš, Martin; Mareček, David; Maršík, Jiří; Novák, Michal; Popel, Martin; Tamchyna, Aleš. 2014. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL). | |
Terms: | area_Europe country_CZ country_GB dcmi_Text iso639_ces iso639_eng olac_primary_text |