OLAC Record
oai:lindat.mff.cuni.cz:11234/1-5812

Metadata
Title:KUKY1.0
Bibliographic Citation:http://hdl.handle.net/11234/1-5812
Creator:Cinková, Silvie
Kuk, Michal
Šamánková, Jana
Kubíková, Barbora
Pospíšil, Přemysl
Mírovský, Jiří
Hladká, Barbora
Novotná, Tereza
Date (W3CDTF):2025-01-27T17:44:52Z
Date Available:2025-01-27T17:44:52Z
Description:KUKY is a curated selection of 224 Czech administrative and legal documents for readability research, stored in two JSON files. The documents come partly from public databases (Office of the Ombudsman, courts) and from private sources (letters, public local administration announcements). Some documents come in documented draft-revision pairs. They are manually enriched with a two-level annotation: "Relevance Stoplight" and "Speech Acts". This annotation mimics the way a plain-language expert scrutinizes a document before redesigning it for better readability: first, they closely read the entire document and detect problematic passages ("Relevance Stoplight"), classifying them as either incomprehensible or superfluous, or approving them as relevant. In a second step, the editor works with the relevant text according to a genre-specific template ("Speech Acts"). At the metadata level, the documents are graded with respect to their readability, as perceived by experienced plain legal writing teachers.
Identifier (URI):http://hdl.handle.net/11234/1-5812
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:readability
legal texts
paraphrases
text coherence
ArgMining
annotation
speech act
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-5812
DateStamp:  2025-01-27
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Cinková, Silvie; Kuk, Michal; Šamánková, Jana; Kubíková, Barbora; Pospíšil, Přemysl; Mírovský, Jiří; Hladká, Barbora; Novotná, Tereza. 2025. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-5812
Up-to-date as of: Wed Mar 5 0:42:44 EST 2025