OLAC Record
oai:www.ldc.upenn.edu:LDC2024S09

Metadata
Title:Ravnursson Faroese Speech and Transcripts
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Hernández Mena, Carlos Daniel, Annika Simonsen, and Jon Gudnason. Ravnursson Faroese Speech and Transcripts LDC2024S09. Web Download. Philadelphia: Linguistic Data Consortium, 2024
Contributor:Hernández Mena, Carlos Daniel
Simonsen, Annika
Gudnason, Jon
Date (W3CDTF):2024
Date Issued (W3CDTF):2024-08-15
Description:*Introduction* Ravnursson Faroese Speech and Transcripts contains 109 hours of Faroese prompted speech from 433 speakers (249 female, 184 male), corresponding transcripts and speaker metadata. It is an extract from the Basic Language Resource Kit 1.0 (BLARK 1.0) developed by the Faroe Islands' Ravnur Project. *Data* Speech data was collected in 2022. Speakers from all major dialect areas in the Faroe Islands in three age groups -- 15-35, 36-60, and 61+ years -- read texts that included a word list, a phrase list, closed vocabulary readings, and short texts. Recordings also contain spontaneous speech. TASCAM DR-40 Linear PCM audio recorders captured speech data at 48 kHz, downsampled for this corpus. The audio data is divided into train, development, and test sets and is presented as flac compressed, single channel, 16 kHz, 16-bit linear PCM. Recordings were orthographically transcribed and time-stamped. Transcripts and speaker metadata are included in a tab separated file. *Samples* Please view this metadata sample (TSV) and audio sample (FLAC). *Updates* None at this time.
Extent:Corpus size: 6647345 KB
Format:Sampling Rate: 16000
Sampling Format: pcm
Identifier:LDC2024S09
https://catalog.ldc.upenn.edu/LDC2024S09
ISLRN: 558-066-910-837-0
DOI: 10.35111/d60c-5x79
Language:Faroese
Language (ISO639):fao
License:Ravnursson Faroese Speech and Transcripts (For-Profit): https://catalog.ldc.upenn.edu/license/ravnursson-faroese-speech-and-transcripts-for-profit.pdf
Ravnursson Faroese Speech and Transcripts (Non-Member): https://catalog.ldc.upenn.edu/license/ravnursson-faroese-speech-and-transcripts-non-member.pdf
Ravnursson Faroese Speech and Transcripts (Not-For-Profit): https://catalog.ldc.upenn.edu/license/ravnursson-faroese-speech-and-transcripts-not-for-profit.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2024S09
Rights Holder:Portions © 2024 Reykjavik University, © 2024 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2024S09
DateStamp:  2024-08-15
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Hernández Mena, Carlos Daniel; Simonsen, Annika; Gudnason, Jon. 2024. Linguistic Data Consortium.
Terms: dcmi_Sound dcmi_Text iso639_fao olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2024S09
Up-to-date as of: Fri Dec 6 7:49:17 EST 2024