OLAC Record
oai:lindat.mff.cuni.cz:11234/1-3262

Metadata
Title:DOESTE v0.5
Bibliographic Citation:http://hdl.handle.net/11234/1-3262
Creator:Martins, Mário
Janssen, Maarten
Santos, Taiza
Lopes, Raquel
Souza, Thiago
Date (W3CDTF):2020-10-09T08:57:36Z
Date Available:2020-10-09T08:57:36Z
Description:DOESTE v0.5 is a set of developmental corpora of texts written by Brazilian and Portuguese school-age children and adolescents. It is a work in progress. The texts written by monolingual children and adolescents in European Portuguese were collected between September 2011 and January 2012, from different public schools in Lisbon (Portugal). It is composed of 244 narrative (n=122) and argumentative (n=122) texts. The subjects (51% female and 49% male) are students enroled in the 5th grade (n=52; mean age=10.19), in the 7th grade (n=92; mean age=12.33) and in the 10th grade (n=100; mean age=15.16) from the Portuguese basic schooling. The subcorpus of Portuguese texts is fully tokenized and morphologically annotated, in addition to presenting the sentence occurrences. The texts written by monolingual children and adolescents in Brazilian Portuguese have been collected since 2017, from different public schools in three cities in Rio Grande do Norte (Brazil). It is currently composed of narrative (n=225) and argumentative (n=225) texts. The subjects (53% female and 47% male) are students enroled in the 5th grade (n=68; mean age=11.13), in the 9th grade (n=82; mean age=15.32) and in the 12th grade (n=224; mean age=17.96) from the Brazilian basic schooling. The subcorpora of Brazilian texts is still in the compilation, but a large part is already searchable, being tokenized and morphologically annotated. The Brazilian subcorpus also presents itself with the original transcripts, along original images. Portuguese and Brazilian texts were collected from similar tasks: Narrative-based task: Tell a remarkable story (real or imagined) that you and your best friend lived during the last school vacation. Argumentative based-task: Do you think social networks (Facebook, Twitter, Google+, Windows Live Space, etc.) are important today? Write a text to be published on your school's blog where you express your opinion on social networks. In this text, you must say whether you are for or against the existence of social networks. Don't forget to justify your opinion! The next version of DOESTE intends to present semantic annotations and clause and t-unit segmentation. DOESTE v0.5 is developed and maintained by the Educational Linguistics Research Group (LEd), based at the Federal Rural University of the Semiarid Region (UFERSA). DOESTE v0.5 by Mário Martins et al. is licensed under CC BY-NC-ND 4.0.
Identifier (URI):http://hdl.handle.net/11234/1-3262
Language:Portuguese
Language (ISO639):por
Publisher:Federal Rural University of the Semiarid Region
Rights:Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
http://creativecommons.org/licenses/by-nc-nd/4.0/
Subject:Developmental corpus
Writing development
School-age language development
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-3262
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Martins, Mário; Janssen, Maarten; Santos, Taiza; Lopes, Raquel; Souza, Thiago. 2020. Federal Rural University of the Semiarid Region.
Terms: area_Europe country_PT dcmi_Text iso639_por olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-3262
Up-to-date as of: Thu Oct 5 0:41:07 EDT 2023