OLAC Record
oai:lindat.mff.cuni.cz:11234/1-5672

Metadata
Title:CorPipe 24 Multilingual CorefUD 1.2 Model (corpipe24-corefud1.2-240906)
Bibliographic Citation:http://hdl.handle.net/11234/1-5672
Creator:Straka, Milan
Date (W3CDTF):2024-10-07T15:30:49Z
Date Available:2024-10-07T15:30:49Z
Description:The `corpipe24-corefud1.2-240906` is a `mT5-large`-based multilingual model for coreference resolution usable in CorPipe 24 (https://github.com/ufal/crac2024-corpipe). It is released under the CC BY-NC-SA 4.0 license. The model is language agnostic (no corpus id on input), so it can be in theory used to predict coreference in any `mT5` language. This model jointly predicts also the empty nodes needed for zero coreference. The paper introducing this model also presents an alternative two-stage approach first predicting empty nodes (via https://www.kaggle.com/models/ufal-mff/crac2024_zero_nodes_baseline/) and then performing coreference resolution (via http://hdl.handle.net/11234/1-5673), which is circa twice as slow but slightly better.
Identifier (URI):http://hdl.handle.net/11234/1-5672
Language:Catalan
Czech
German
English
Spanish
French
Hungarian
Lithuanian
Norwegian Bokmål
Norwegian Nynorsk
Polish
Russian
Turkish
Church Slavic
Ancient Greek (to 1453)
Ancient Hebrew
Language (ISO639):cat
ces
deu
eng
spa
fra
hun
lit
nob
nno
pol
rus
tur
chu
grc
hbo
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Replaces (URI):http://hdl.handle.net/11234/1-5369
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:coreference resolution
CorPipe
CorefUD
Type:toolService
Type (DCMI):Software

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-5672
DateStamp:  2024-10-07
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Straka, Milan. 2024. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Asia area_Europe country_CZ country_DE country_ES country_FR country_GB country_GR country_HU country_IL country_LT country_PL country_RU country_TR dcmi_Software iso639_cat iso639_ces iso639_chu iso639_deu iso639_eng iso639_fra iso639_grc iso639_hbo iso639_hun iso639_lit iso639_nno iso639_nob iso639_pol iso639_rus iso639_spa iso639_tur


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-5672
Up-to-date as of: Wed Mar 5 0:42:39 EST 2025