OLAC Record
oai:catalogue.elra.info:ELRA-W0082

Metadata
Title:88milSMS. A corpus of authentic text messages in French
Access Rights: Rights available for: nonCommercialUse
Date Available (W3CDTF):2015-02-11
Date Issued (W3CDTF):2015-02-11
Date Modified (W3CDTF):2015-02-11
Description:A pluridisciplinary team of linguists and computer scientists (Rachel Panckhurst, Catherine Détrie, Cédric Lopez, Claudine Moïse, Mathieu Roche, Bertrand Verine (Praxiling, Lirmm, Lidilem, Tetis, Viseo) collected more than 88,000 French authentic text messages in Montpellier (2011), as part of the sud4science LR project (Sud4science Languedoc Roussillon. Mutation des pratiques scripturales en communication électronique médiée (main financial support: MSH-M)). This project is part of a vast international project entitled sms4science, coordinated by the CENTAL at Université catholique de Louvain (UCL) in Belgium. Participants from the general public, who donated their SMS to science, were also able to fill in a sociolinguistic questionnaire. The text messages from the sud4science LR project were then semi-automatically anonymised (in collaboration with student internships and a legal adviser-CIL, Nicolas Hvoinsky, SAJI, Université Paul-Valéry), before being partially transcoded (into standardised French) and annotated (cf. Panckhurst et al. 2013).To obtain the corpus, please visit the following website: http://88milsms.huma-num.fr/
Identifier:ELRA-W0082
ISLRN: 024-713-187-947-8
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-W0082/
Language:French
Language (ISO639):fra
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0082
DateStamp:  2015-02-11
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2015. ELRA (European Language Resources Association).
Terms: area_Europe country_FR dcmi_Text iso639_fra olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0082
Up-to-date as of: Fri Apr 19 6:30:14 EDT 2024