OLAC Record
oai:lindat.mff.cuni.cz:11372/LRT-2614

Metadata
Title:ParCorFull: A Parallel Corpus Annotated with Full Coreference
Bibliographic Citation:http://hdl.handle.net/11372/LRT-2614
Creator:Lapshinova-Koltunski, Ekaterina
Hardmeier, Christian
Krielke, Pauline
Date (W3CDTF):2018-05-08T12:07:36Z
Date Available:2018-05-08T12:07:36Z
Description:ParCorFull is a parallel corpus annotated with full coreference chains that has been created to address an important problem that machine translation and other multilingual natural language processing (NLP) technologies face -- translation of coreference across languages. Our corpus contains parallel texts for the language pair English-German, two major European languages. Despite being typologically very close, these languages still have systemic differences in the realisation of coreference, and thus pose problems for multilingual coreference resolution and machine translation. Our parallel corpus covers the genres of planned speech (public lectures) and newswire. It is richly annotated for coreference in both languages, including annotation of both nominal coreference and reference to antecedents expressed as clauses, sentences and verb phrases. This resource supports research in the areas of natural language processing, contrastive linguistics and translation studies on the mechanisms involved in coreference translation in order to develop a better understanding of the phenomenon.
Identifier (URI):http://hdl.handle.net/11372/LRT-2614
Language:English
German
Language (ISO639):eng
deu
Publisher:Universität des Saarlandes
Uppsala University
Rights:Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
http://creativecommons.org/licenses/by-nc-nd/4.0/
Subject:parallel corpus
annotated corpus
coreference
anaphora resolution
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-2614
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Lapshinova-Koltunski, Ekaterina; Hardmeier, Christian; Krielke, Pauline. 2018. Universität des Saarlandes.
Terms: area_Europe country_DE country_GB dcmi_Text iso639_deu iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11372/LRT-2614
Up-to-date as of: Thu Oct 5 0:40:51 EDT 2023