OLAC Record
oai:lindat.mff.cuni.cz:11234/1-5824

Metadata
Title:Verbs annotated for morphemic structure in Czech, English, German, Spanish
Bibliographic Citation:http://hdl.handle.net/11234/1-5824
Creator:Hledíková, Hana
Date (W3CDTF):2025-01-03T11:39:37Z
Date Available:2025-01-03T11:39:37Z
Description:A sample of verb lemmas in four languages: Czech (19,030 lemmas), English (9,965 lemmas), German (27,224 lemmas), Spanish (11,888 lemmas). Each verb lemma is annotated for its morphemic structure (i.e., segmented into the prefiex(es), root(s), suffix(es) and ending(s) that the given lemma contains), classification of its root morph to a root morpheme where needed (to facilitate grouping of verbs with the same root morpheme), and its frequency of the verb in a 100 M corpus. Two versions are available for each language: one with a more coarse-grained segmentation, which captures the morphemic structure that is synchronically available, and a version with a more fine-grained segmentation, which also captures the word's etymology.
Identifier (URI):http://hdl.handle.net/11234/1-5824
Language:Czech
English
German
Spanish
Language (ISO639):ces
eng
deu
spa
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Creative Commons - Attribution 4.0 International (CC BY 4.0)
http://creativecommons.org/licenses/by/4.0/
Subject:morphemes
word-formation
verbs
Czech language
English language
German language
Spanish language
Subject (ISO639):ces
eng
deu
spa
Type:lexicalConceptualResource
Type (DCMI):Text
Type (OLAC):lexicon

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-5824
DateStamp:  2025-01-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Hledíková, Hana. 2025. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ country_DE country_ES country_GB dcmi_Text iso639_ces iso639_deu iso639_eng iso639_spa olac_lexicon

Inferred Metadata

Country: Czech RepublicGermanySpainUnited Kingdom
Area: Europe


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-5824
Up-to-date as of: Wed Mar 5 0:42:45 EST 2025