OLAC Record
oai:lindat.mff.cuni.cz:11234/1-4613

Metadata
Title:POS Tagging and Lemmatization (Czech model)
Bibliographic Citation:http://hdl.handle.net/11234/1-4613
Creator:Vysušilová, Petra
Straka, Milan
Date (W3CDTF):2021-11-18T15:58:05Z
Date Available:2021-11-18T15:58:05Z
Description:Model trained for Czech POS Tagging and Lemmatization using Czech version of BERT model, RobeCzech. Model is trained on data from Prague Dependency Treebank 3.5. Model is a part of Czech NLP with Contextualized Embeddings master thesis and presented a state-of-the-art performance on the date of submission of the work. Demo jupyter notebook is available on the project GitHub.
Identifier (URI):http://hdl.handle.net/11234/1-4613
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:BERT
PoS tagging
lemmatization
Czech language
Subject (ISO639):ces
Type:languageDescription
Type (DCMI):Text
Type (OLAC):language_description

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-4613
DateStamp:  2021-11-18
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Vysušilová, Petra; Straka, Milan. 2021. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_language_description

Inferred Metadata

Country: Czech Republic
Area: Europe


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-4613
Up-to-date as of: Thu Oct 5 0:43:08 EDT 2023