OLAC Record
oai:lindat.mff.cuni.cz:11234/1-3367

Metadata
Title:Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)
Bibliographic Citation:http://hdl.handle.net/11234/1-3367
Creator:Ramisch, Carlos
Guillaume, Bruno
Savary, Agata
Waszczuk, Jakub
Candito, Marie
Vaidya, Ashwini
Barbu Mititelu, Verginica
Bhatia, Archna
Iñurrieta, Uxoa
Giouli, Voula
Güngör, Tunga
Jiang, Menghan
Lichte, Timm
Liebeskind, Chaya
Monti, Johanna
Ramisch, Renata
Stymme, Sara
Walsh, Abigail
Xu, Hongzhi
Palka-Binkiewicz, Emilia
Ehren, Rafael
Stymne, Sara
Constant, Matthieu
Pasquer, Caroline
Parmentier, Yannick
Antoine, Jean-Yves
Carlino, Carola
Caruso, Valeria
Di Buono, Maria Pia
Pascucci, Antonio
Raffone, Annalisa
Riccio, Anna
Sangati, Federico
Speranza, Giulia
Cordeiro, Silvio Ricardo
de Medeiros Caseli, Helena
Miranda, Isaac
Rademaker, Alexandre
Vale, Oto
Villavicencio, Aline
Wick Pedro, Gabriela
Wilkens, Rodrigo
Zilio, Leonardo
Rizea, Monica-Mihaela
Ionescu, Mihaela
Onofrei, Mihaela
Chen, Jia
Ge, Xiaomin
Hu, Fangyuan
Hu, Sha
Li, Minli
Liu, Siyuan
Qin, Zhenzhen
Sun, Ruilong
Wang, Chenweng
Xiao, Huangyang
Yan, Peiyi
Yih, Tsy
Yu, Ke
Yu, Songping
Zeng, Si
Zhang, Yongchen
Zhao, Yun
Foufi, Vassiliki
Fotopoulou, Aggeliki
Markantonatou, Stella
Papadelli, Stella
Louizou, Sevasti
Aduriz, Itziar
Estarrona, Ainara
Gonzalez, Itziar
Gurrutxaga, Antton
Uria, Larraitz
Urizar, Ruben
Foster, Jennifer
Lynn, Teresa
Elyovitch, Hevi
Ha-Cohen Kerner, Yaakov
Malka, Ruth
Jain, Kanishka
Puri, Vandana
Ratori, Shraddha
Shukla, Vishakha
Srivastava, Shubham
Berk, Gozde
Erden, Berna
Yirmibeşoğlu, Zeynep
Date (W3CDTF):2020-10-08T11:08:16Z
Date Available:2020-10-08T11:08:16Z
Description:This multilingual resource contains corpora in which verbal MWEs have been manually annotated, gathered at the occasion of the 1.2 edition of the PARSEME Shared Task on semi-supervised Identification of Verbal MWEs (2020). VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). For the 1.2 shared task edition, the data covers 14 languages, for which VMWEs were annotated according to the universal guidelines. The corpora are provided in the cupt format, inspired by the CONLL-U format. Morphological and syntactic information ­­­­– not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.2 (2020). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.2
Identifier (URI):http://hdl.handle.net/11234/1-3367
Is Replaced By (URI):http://hdl.handle.net/11372/LRT-5124
Language:German
Modern Greek (1453-)
Basque
French
Irish
Hebrew
Hindi
Italian
Polish
Portuguese
Romanian
Swedish
Turkish
Chinese
Language (ISO639):deu
ell
eus
fra
gle
heb
hin
ita
pol
por
ron
swe
tur
zho
Publisher:PARSEME
Replaces (URI):http://hdl.handle.net/11372/LRT-2842
Rights:PARSEME Shared Task Data (v. 1.2) Agreement
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.2
Subject:multiword expressions
verbal multiword expressions
light verb construction
verb-particle constructions
inherently reflexive verbs
verbal idioms
multi-verb constructions
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-3367
DateStamp:  2023-05-10
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Ramisch, Carlos; Guillaume, Bruno; Savary, Agata; Waszczuk, Jakub; Candito, Marie; Vaidya, Ashwini; Barbu Mititelu, Verginica; Bhatia, Archna; Iñurrieta, Uxoa; Giouli, Voula; Güngör, Tunga; Jiang, Menghan; Lichte, Timm; Liebeskind, Chaya; Monti, Johanna; Ramisch, Renata; Stymme, Sara; Walsh, Abigail; Xu, Hongzhi; Palka-Binkiewicz, Emilia; Ehren, Rafael; Stymne, Sara; Constant, Matthieu; Pasquer, Caroline; Parmentier, Yannick; Antoine, Jean-Yves; Carlino, Carola; Caruso, Valeria; Di Buono, Maria Pia; Pascucci, Antonio; Raffone, Annalisa; Riccio, Anna; Sangati, Federico; Speranza, Giulia; Cordeiro, Silvio Ricardo; de Medeiros Caseli, Helena; Miranda, Isaac; Rademaker, Alexandre; Vale, Oto; Villavicencio, Aline; Wick Pedro, Gabriela; Wilkens, Rodrigo; Zilio, Leonardo; Rizea, Monica-Mihaela; Ionescu, Mihaela; Onofrei, Mihaela; Chen, Jia; Ge, Xiaomin; Hu, Fangyuan; Hu, Sha; Li, Minli; Liu, Siyuan; Qin, Zhenzhen; Sun, Ruilong; Wang, Chenweng; Xiao, Huangyang; Yan, Peiyi; Yih, Tsy; Yu, Ke; Yu, Songping; Zeng, Si; Zhang, Yongchen; Zhao, Yun; Foufi, Vassiliki; Fotopoulou, Aggeliki; Markantonatou, Stella; Papadelli, Stella; Louizou, Sevasti; Aduriz, Itziar; Estarrona, Ainara; Gonzalez, Itziar; Gurrutxaga, Antton; Uria, Larraitz; Urizar, Ruben; Foster, Jennifer; Lynn, Teresa; Elyovitch, Hevi; Ha-Cohen Kerner, Yaakov; Malka, Ruth; Jain, Kanishka; Puri, Vandana; Ratori, Shraddha; Shukla, Vishakha; Srivastava, Shubham; Berk, Gozde; Erden, Berna; Yirmibeşoğlu, Zeynep. 2020. PARSEME.
Terms: area_Asia area_Europe country_DE country_ES country_FR country_GR country_IE country_IL country_IN country_IT country_PL country_PT country_RO country_SE country_TR dcmi_Text iso639_deu iso639_ell iso639_eus iso639_fra iso639_gle iso639_heb iso639_hin iso639_ita iso639_pol iso639_por iso639_ron iso639_swe iso639_tur iso639_zho olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-3367
Up-to-date as of: Thu Oct 5 0:41:08 EDT 2023