OLAC Record
oai:lindat.mff.cuni.cz:11234/1-5814

Metadata
Title:PDT-Vallex: Czech Valency lexicon linked to treebanks 4.5 (PDT-Vallex 4.5)
Bibliographic Citation:http://hdl.handle.net/11234/1-5814
Creator:Urešová, Zdeňka
Bémová, Alevtina
Fučíková, Eva
Hajič, Jan
Kolářová, Veronika
Mikulová, Marie
Pajas, Petr
Panevová, Jarmila
Štěpánek, Jan
Date (W3CDTF):2025-01-03T19:00:23Z
Date Available:2025-01-03T19:00:23Z
Description:The valency lexicon PDT-Vallex 4.5 is a part of the PDT-C 2.0 release https://hdl.handle.net/11234/1-5813. It is a slightly modified version of PDT-Vallex 4.0 from 2020 (as a part of PDT-C 1.0 corpus) for full compatibility with PDT-C 2.0 annotation, including a completely reworked reference IDs for the word and frame entries. PDT-Vallex has been built in close connection with the annotation of the Prague Dependency Treebank project (PDT) and its successors (mainly the Prague Czech-English Dependency Treebank project, PCEDT, the spoken language corpus (PDTSC) and corpus of user-generated texts in the project Faust). It contains over 14500 valency frames for almost 8500 verbs which occurred in the PDT, PCEDT, PDTSC and Faust corpora. In addition, there are nouns, adjectives and adverbs, linked from the PDT part only, increasing the total to over 20000 valency frames for almost 13000 words. All the corpora have been published in 2024 as the PDT-C 2.0 corpus with the PDT-Vallex 4.5 dictionary included; this is a copy of the dictionary published as a separate item for those not interested in the corpora themselves. It is available in electronically processable format (XML), and also in more human readable form including corpus examples (see the project and web browser links below, and the links to its main publications elsewhere in this metadata). The main feature of the lexicon is its linking to the annotated corpora - each occurrence of each verb is linked to the appropriate valency frame with additional (generalized) information about its usage and surface morphosyntactic form alternatives.
Identifier (URI):http://hdl.handle.net/11234/1-5814
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Replaces (URI):http://hdl.handle.net/11234/1-3499
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:verbal valency
valency
annotation
linguistic data
lexicon
lexical semantics
PDT
Czech language
Subject (ISO639):ces
Type:lexicalConceptualResource
Type (DCMI):Text
Type (OLAC):lexicon

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-5814
DateStamp:  2025-01-24
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Urešová, Zdeňka; Bémová, Alevtina; Fučíková, Eva; Hajič, Jan; Kolářová, Veronika; Mikulová, Marie; Pajas, Petr; Panevová, Jarmila; Štěpánek, Jan. 2025. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_lexicon

Inferred Metadata

Country: Czech Republic
Area: Europe


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-5814
Up-to-date as of: Wed Mar 5 0:42:44 EST 2025