OLAC Record oai:www.ldc.upenn.edu:LDC2022T04 |
Metadata | ||
Title: | Qatari Corpus of Argumentative Writing | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Ahmed, Abdelhamid M., et al. Qatari Corpus of Argumentative Writing LDC2022T04. Web Download. Philadelphia: Linguistic Data Consortium, 2022 | |
Contributor: | Ahmed, Abdelhamid M. | |
Myhill, Debra | ||
Abdollahzadeh, Esmaeel | ||
McCallum, Lee | ||
Zaghouani, Wajdi | ||
Rezk, Lameya | ||
Jrad, Anissa | ||
Zhang, Xiao | ||
Date (W3CDTF): | 2022 | |
Date Issued (W3CDTF): | 2022-07-15 | |
Description: | *Introduction* Qatari Corpus of Argumentative Writing was developed by Qatar University, University of Exeter and Hamad Bin Khalifa University and is comprised of approximately 200,000 tokens of Arabic and English writing by undergraduate students (159 female, 36 male) along with annotations and related metadata. Students were native Arabic speakers and fluent in English; each student wrote one Arabic and one English essay in response to specific argumentative prompts. They were instructed to include in their essays a clear thesis statement supported by relevant evidence. *Data* The corpus is divided into Arabic and English parts, each of which contains 195 essays. Part-of-speech annotated files are included with the essay text. All text files are in UTF-8 encoded text format. Metadata is comprised of information about the students (gender, major, first language, second language) and information about the essay texts (serial numbers of texts, word limits, genre, date of writing, time spent on writing, place of writing). Metadata is presented in UTF-8 encoded CSV format. *Samples* Please view this text sample (TXT) and annotation sample (TXT). *Updates* None at this time. | |
Extent: | Corpus size: 8301 KB | |
Identifier: | LDC2022T04 | |
https://catalog.ldc.upenn.edu/LDC2022T04 | ||
ISBN: 1-58563-992-3 | ||
ISLRN: 703-290-141-447-2 | ||
DOI: 10.35111/k307-kg62 | ||
Language: | Arabic | |
English | ||
Language (ISO639): | ara | |
eng | ||
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2022T04 | |
Rights Holder: | Portions © 2022 Hamad Bin Khalifa University, © 2022 Qatar University, © 2022 University of Exeter, © 2022 Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2022T04 | |
DateStamp: | 2024-09-27 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Ahmed, Abdelhamid M.; Myhill, Debra; Abdollahzadeh, Esmaeel; McCallum, Lee; Zaghouani, Wajdi; Rezk, Lameya; Jrad, Anissa; Zhang, Xiao. 2022. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Text iso639_ara iso639_eng olac_primary_text |