OLAC Record oai:catalogue.elra.info:ELRA-S0415 |
Metadata | ||
Title: | Persian Speech Corpus | |
Access Rights: | Rights available for: nonCommercialUse, commercialUse | |
Date Available (W3CDTF): | 2022-09-27 | |
Date Issued (W3CDTF): | 2022-09-27 | |
Description: | This dataset contains more than 31 hours and 30 minutes of Persian scripted monologue and dialogue data, recorded from 89 Persian speakers (39 males and 50 females) between 17-80 years old in Iran (Tehrani dialect). Recordings were made between April and January 2022. Data consists of read and spontaneous speech recordings: books read by a person, recorded podcasts, articles in the newspapers, radio conversations, phone dialogues.Domains are labelled and include: Accounting (ACC), Banking (BAN), Economics (ECO), Finance (FIN), Insurance (INS), Literature (LIT), Marketing (MBA), Medicine (MED), Psychology (PSY), Science (SCI), Technology (TEK), Telecommunication (TEL), and Law (LAW).The total number of words is 242757The package consists of 12,232 recording files. Metadata files, including transcriptions, are provided in TSV format and audio files are provided in MP3 format. An Access database containing all written data of the corpus is also provided. All transcriptions were manually done by native speakers of Persian. | |
Identifier: | ELRA-S0415 | |
ISLRN: 058-406-130-314-1 | ||
Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-S0415/ | |
Language: | Persian | |
Language (ISO639): | fas | |
Medium: | Not specified | |
Publisher: | ELRA (European Language Resources Association) | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ELRA Catalogue of Language Resources | |
Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:catalogue.elra.info:ELRA-S0415 | |
DateStamp: | 2022-09-27 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | n.a. 2022. ELRA (European Language Resources Association). | |
Terms: | dcmi_Sound iso639_fas olac_primary_text |