OLAC Record oai:catalogue.elra.info:ELRA-S0379 |
Metadata | ||
Title: | JV_TDM Corpus | |
Access Rights: | Rights available for: attribution | |
Date Available (W3CDTF): | 2016-01-05 | |
Date Issued (W3CDTF): | 2016-01-05 | |
Date Modified (W3CDTF): | 2016-01-19 | |
Description: | The JV_TDM corpus provides a phonetic annotation of 37 chapters of the original French version of “Around the World in 80 Days” by Jules Verne read by a single speaker. Each chapter has been annotated in a separate .TextGrid file. The audio files are not included in this release. They are available under a CC BY-NC-SA licence on the site www.litteratureaudio.com (www.litteratureaudio.com/livre-audio-gratuit-mp3/jules-verne-le-tour-du-monde-en-80-jours.html).The total audio size is 6h 41mn 36s with 5h 2mn 41s of speech. In the JV_TDM corpus, the speaker uttered 78,876 words at an average speed of 5.82 syllables and 13.49 phones per second. The speaker produced 244,908 phones and 11,352 pauses (short and long). All phonemes except glottal stops and palatal/velar nasals are encountered more than 1000 times.The .TextGrid files contain several annotation tiers: phoneme, number of alphanumeric characters corresponding to a phone, syllable, transcription, PoS, paragraph break, sentence break, prosodic annotations, breathing pauses.With the text-to-speech system COMPOST, the original text material was first PoS annotated, phonetically transcribed, syllabified and plausible pauses were inserted. Text-to-speech alignment was then performed on paragraphs which were manually delimited with Praat. The segmentation and all the annotations were manually validated.Reference:Bailly, G. & C. Gouvernayre (2012). Pauses and respiratory markers of the structure of book reading. Interspeech. Portland, OR, pp. 2218-2221. | |
Identifier: | ELRA-S0379 | |
ISLRN: 371-240-320-910-4 | ||
Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-S0379/ | |
Language: | French | |
Language (ISO639): | fra | |
Medium: | downloadable | |
Publisher: | ELRA (European Language Resources Association) | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ELRA Catalogue of Language Resources | |
Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:catalogue.elra.info:ELRA-S0379 | |
DateStamp: | 2016-01-05 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | n.a. 2016. ELRA (European Language Resources Association). | |
Terms: | area_Europe country_FR dcmi_Sound iso639_fra olac_primary_text |