![]() |
OLAC Record oai:www.ldc.upenn.edu:LDC2020T12 |
| Metadata | ||
| Title: | SemTransCNC | |
| Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
| Bibliographic Citation: | Wang, Shichang, et al. SemTransCNC LDC2020T12. Web Download. Philadelphia: Linguistic Data Consortium, 2020 | |
| Contributor: | Wang, Shichang | |
| Huang, Chu-Ren | ||
| Yao, Yao | ||
| Chan, Angel | ||
| Date (W3CDTF): | 2020 | |
| Date Issued (W3CDTF): | 2020-06-22 | |
| Description: | *Introduction* SemTransCNC was developed by The Hong Kong Polytechnic University. It is comprised of a semantic transparency dataset of Chinese nominal compounds built using a series of crowd-based experiments. Nominal compounds were selected from the Sinica Corpus and a modern Chinese lexicon. Crowd workers answered questionnaires that included demographic information and questions about the Chinese language. For assessing overall semantic transparency (OST) of selected compounds, they answered the question: "How is the sum of the meanings of A and B similar to the meaning of AB?" For assessing constituent semantic transparency (CST), they were asked to describe the similarity of A alone to its meaning in AB and the meaning of B alone to its meaning in AB. *Data* SemTransCNC consists of OST and CST data for 1,176 dimorphemic Chinese nominal compounds, which consist of free morphemes and have mid-range frequencies. The text data is presented as a UTF-8 encoded comma separated text file. *Samples* Please view this text sample (CSV). *Updates* None at this time. | |
| Extent: | Corpus size: 140 KB | |
| Identifier: | LDC2020T12 | |
| https://catalog.ldc.upenn.edu/LDC2020T12 | ||
| ISBN: 1-58563-931-1 | ||
| ISLRN: 835-247-023-332-5 | ||
| DOI: 10.35111/vreb-7n07 | ||
| Language: | Mandarin Chinese | |
| Language (ISO639): | cmn | |
| License: | SemTransCNC Agreement: https://catalog.ldc.upenn.edu/license/semtranscnc-agreement.pdf | |
| Medium: | Distribution: Web Download | |
| Publisher: | Linguistic Data Consortium | |
| Publisher (URI): | https://www.ldc.upenn.edu | |
| Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2020T12 | |
| Rights Holder: | Portions © 2020 The Hong Kong Polytechnic University, © 2020 Trustees of the University of Pennsylvania | |
| Type (DCMI): | Text | |
| Type (OLAC): | primary_text | |
OLAC Info |
||
| Archive: | The LDC Corpus Catalog | |
| Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
| GetRecord: | OAI-PMH request for OLAC format | |
| GetRecord: | Pre-generated XML file | |
OAI Info |
||
| OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2020T12 | |
| DateStamp: | 2021-01-01 | |
| GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
| Citation: | Wang, Shichang; Huang, Chu-Ren; Yao, Yao; Chan, Angel. 2020. Linguistic Data Consortium. | |
| Terms: | area_Asia country_CN dcmi_Text iso639_cmn olac_primary_text | |