OLAC Record oai:catalogue.elra.info:ELRA-S0275 |
Metadata | ||
Title: | Slovenian BNSI Broadcast News Speech Corpus | |
Access Rights: | Rights available for: nonCommercialUse, commercialUse | |
Date Available (W3CDTF): | 2008-04-22 | |
Date Issued (W3CDTF): | 2008-04-22 | |
Date Modified (W3CDTF): | 2008-04-22 | |
Description: | This speech database consists of TV news shows (both evening news, “TV Dnevnik” and late night news, “Odmevi”), from the archive of a Slovenian national broadcaster RTV Slovenia. The recordings took place between June 1999 and May 2003. The database comprises a total of 36 hours of recordings (training set: 30 hours, development set: 3 hours and test set: 3 hours), transcribed and manually checked using the Transcriber tool. Transcription conventions are based on documents defined by LDC, LIMSI and COST 278 BN SIG. There are 268,000 words in transcriptions, out of which 37,000 are distinct words. The transcription files contain: orthographic transcriptions, information on acoustic conditions and background, segmentation on turn and section level. The topic is described and marked (25 topic categories) for each section of news show. Speaker information consists of gender, speaking style, accent and origin. 1,565 speakers were recorded (1,069 males, 477 females, 19 unspecified). The speech signal is as follows: 16kHz, 16 bit, WAV, 1 channel. | |
Identifier: | ELRA-S0275 | |
ISLRN: 502-280-144-938-4 | ||
Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-S0275/ | |
Language: | Slovenian | |
Language (ISO639): | slv | |
Medium: | Not specified | |
Publisher: | ELRA (European Language Resources Association) | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ELRA Catalogue of Language Resources | |
Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:catalogue.elra.info:ELRA-S0275 | |
DateStamp: | 2008-04-22 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | n.a. 2008. ELRA (European Language Resources Association). | |
Terms: | area_Europe country_SI dcmi_Sound iso639_slv olac_primary_text |