OLAC Record
oai:lindat.mff.cuni.cz:11234/1-5413

Metadata
Title:Diakorp v6: diachronic corpus of Czech
Bibliographic Citation:http://hdl.handle.net/11234/1-5413
Creator:Kučera, Karel
Řehořková, Anna
Stluka, Martin
Date (W3CDTF):2024-02-01T21:14:24Z
Date Available:2024-02-01T21:14:24Z
Description:Diachronic corpus of Czech sized 3.45 million words (i.e. 4.1 million tokens). It contains 116 texts from the 14th-20th century period. The texts are transcribed, not transliterated. Diakorp v6 is provided in a CoNLL-U-like vertical format used as an input to the Manatee query engine. The data thus correspond to the corpus available via the KonText query interface to the registered users of CNC at http://www.korpus.cz
Identifier (URI):http://hdl.handle.net/11234/1-5413
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Arts, Institute of the Czech National Corpus
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:corpus
diachronic
Czech
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-5413
DateStamp:  2024-02-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Kučera, Karel; Řehořková, Anna; Stluka, Martin. 2024. Charles University, Faculty of Arts, Institute of the Czech National Corpus.
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-5413
Up-to-date as of: Wed Mar 5 0:42:35 EST 2025