OLAC Record
oai:www.ldc.upenn.edu:LDC2022L01

Metadata
Title:Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Lau, Mingfei, et al. Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon LDC2022L01. Web Download. Philadelphia: Linguistic Data Consortium, 2022
Contributor:Lau, Mingfei
Zhong, Muhan
Lau, Chaak-ming
Su, Jian
Chan, Henry
Cheung, Bing
Date (W3CDTF):2022
Date Issued (W3CDTF):2022-10-17
Description:*Introduction* Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon was developed by the Cantonese Computational Linguistics Infrastructure Working Group. It contains approximately 130,000 Cantonese character, word, and phrase entries paired with their corresponding romanized pronunciations in Jyutping, a scheme created by The Linguistic Society of Hong Kong. *Data* Data was collected from a variety of physical and online sources. The character collection was subjected to a normalization process for differences between traditional and simplified Chinese, regional differences and other variants in Chinese characters, and differences in orthography. Additional information about this process and the lexicon in general is available in the documentation included with this release. The corpus data is presented in a collection of UTF-8 encoded csv files. *Samples* Please view this word sample. *Updates* None at this time.
Extent:Corpus size: 1915 KB
Identifier:LDC2022L01
https://catalog.ldc.upenn.edu/LDC2022L01
ISBN: 1-58563-998-2
ISLRN: 401-658-348-056-8
DOI: 10.35111/8gvn-bj05
Language:Yue Chinese
Language (ISO639):yue
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2022L01
Rights Holder:Portions © 2022 Cantonese Computational Linguistics Infrastructure Working Group, © 2022 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):lexicon

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2022L01
DateStamp:  2023-01-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Lau, Mingfei; Zhong, Muhan; Lau, Chaak-ming; Su, Jian; Chan, Henry; Cheung, Bing. 2022. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Text iso639_yue olac_lexicon


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2022L01
Up-to-date as of: Thu Oct 24 7:31:26 EDT 2024