OLAC Record
oai:www.ldc.upenn.edu:LDC2022T04

Metadata
Title:Qatari Corpus of Argumentative Writing
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Ahmed, Abdelhamid M., et al. Qatari Corpus of Argumentative Writing LDC2022T04. Web Download. Philadelphia: Linguistic Data Consortium, 2022
Contributor:Ahmed, Abdelhamid M.
Myhill, Debra
Abdollahzadeh, Esmaeel
McCallum, Lee
Zaghouani, Wajdi
Rezk, Lameya
Jrad, Anissa
Zhang, Xiao
Date (W3CDTF):2022
Date Issued (W3CDTF):2022-07-15
Description:*Introduction* Qatari Corpus of Argumentative Writing was developed by Qatar University, University of Exeter and Hamad Bin Khalifa University and is comprised of approximately 200,000 tokens of Arabic and English writing by undergraduate students (159 female, 36 male) along with annotations and related metadata. Students were native Arabic speakers and fluent in English; each student wrote one Arabic and one English essay in response to specific argumentative prompts. They were instructed to include in their essays a clear thesis statement supported by relevant evidence. *Data* The corpus is divided into Arabic and English parts, each of which contains 195 essays. Part-of-speech annotated files are included with the essay text. All text files are in UTF-8 encoded text format. Metadata is comprised of information about the students (gender, major, first language, second language) and information about the essay texts (serial numbers of texts, word limits, genre, date of writing, time spent on writing, place of writing). Metadata is presented in UTF-8 encoded CSV format. *Samples* Please view this text sample (TXT) and annotation sample (TXT). *Updates* None at this time.
Extent:Corpus size: 8301 KB
Identifier:LDC2022T04
https://catalog.ldc.upenn.edu/LDC2022T04
ISBN: 1-58563-992-3
ISLRN: 703-290-141-447-2
DOI: 10.35111/k307-kg62
Language:Arabic
English
Language (ISO639):ara
eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2022T04
Rights Holder:Portions © 2022 Hamad Bin Khalifa University, © 2022 Qatar University, © 2022 University of Exeter, © 2022 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2022T04
DateStamp:  2024-09-27
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Ahmed, Abdelhamid M.; Myhill, Debra; Abdollahzadeh, Esmaeel; McCallum, Lee; Zaghouani, Wajdi; Rezk, Lameya; Jrad, Anissa; Zhang, Xiao. 2022. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_ara iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2022T04
Up-to-date as of: Fri Dec 6 7:49:11 EST 2024