OLAC Record
oai:lindat.mff.cuni.cz:11372/LRT-2725

Metadata
Title:Test Data EN-DE MT_PBSMT APE Shared Task WMT18
Bibliographic Citation:http://hdl.handle.net/11372/LRT-2725
Creator:Turchi, Marco
Negri, Matteo
Chatterjee, Rajen
Date (W3CDTF):2018-05-03T06:43:41Z
Date Available:2018-05-03T06:43:41Z
Description:Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already tokenized. Test set contains 2,000 pairs. A phrase-based machine translation system has been used to generate the target segments. This test set is sampled from the same dataset used for the 2016 and 2017 APE shared task editions. All data is provided by the EU project QT21 (http://www.qt21.eu/).
Identifier (URI):http://hdl.handle.net/11372/LRT-2725
Language:English
German
Language (ISO639):eng
deu
Publisher:Fondazione Bruno Kessler, Trento, Italy
Rights:AGREEMENT ON THE USE OF DATA IN QT21 APE Task
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21
Subject:machine translation
shared task
automatic post-editing
post-editing
phrase-based MT
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-2725
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Turchi, Marco; Negri, Matteo; Chatterjee, Rajen. 2018. Fondazione Bruno Kessler, Trento, Italy.
Terms: area_Europe country_DE country_GB dcmi_Text iso639_deu iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11372/LRT-2725
Up-to-date as of: Thu Oct 5 0:40:53 EDT 2023