OLAC Record oai:catalogue.elra.info:ELRA-S0412 |
Metadata | ||
Title: | Japanese Kids Speech database (Upper Grade) | |
Access Rights: | Rights available for: nonCommercialUse, commercialUse | |
Date Available (W3CDTF): | 2020-10-08 | |
Date Issued (W3CDTF): | 2020-10-08 | |
Description: | The Japanese Kids Speech database (Upper Grade) contains the total recordings of 232 Japanese Kids speakers (104 males and 128 females), from 9 to 13 years’ old (fourth, fifth and sixth graders in elementary school), recorded in quiet rooms using smartphones. This database may be combined with the Japanese Kids Speech database (Lower Grade) also available in the ELRA Catalogue under reference ELRA-S0411.Number of speakers, utterances and duration, age are as follows :Number of speakers 232 (104 male/128 female)Number of utterances (average):385 utterances per speakerTotal number of utterances:89,454Age: from 9 to 13 years' oldTotal hours of data: 145.41018 sentences were used. Recordings were made through smartphones and audio data stored in .wav files as sequences of 16KHz Mono, 16 bits, Linear PCM.Database:・Audio data: WAV format, 16KHz, 16bit, mono (recorded with smartphone)・Recording scripts: TSV format(tab-delimited), UTF-8 (without BOM)・Transcription data: TSV format(tab-delimited), UTF-8 (without BOM)・Size: 16.2GBNumber of speakers per age:9 years' old: 56 (21 male, 35 female)10 years' old: 71 (30 male, 41 female)11 years' old: 65 (28 male, 37 female)12 years' old: 38 (24 male, 14 female)13 years' old: 2 (1 male, 1 female)Structure of database:├─ readme.txt├─ Japanese Kids Speech Database.pdfDescription document of the database├─ Transcription.tsvTranscription├─ scripts.tsvScript│└─ voices/directory of audio data ├─ high/directory of upper grade └─(speaker_ID/)directory of speaker ID (six digits) └─(audio_file)audio file (WAV format, 16KHz, 16bit, mono)File naming conventions of audio files are as follows:Field number | Contents | Description | Remarks0 | Language ID | “JA” (fixed) | Japanese1 | Speaker ID | Six digit | 5XXXXX2 | Script ID | HXXXX | XXXX: four digits3 | Age | Two digits4 | Gender | M: male, F: femaleFiled separation character is “_”.For example, if the audio file name is “JA_500002_H0001_10_F.wav, this file has the following meaning:JA: Language ID (Japanese)500002: speaker ID H0001: script ID 10: age (ten years old)F: gender (female) | |
Identifier: | ELRA-S0412 | |
ISLRN: 846-295-092-462-7 | ||
Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-S0412/ | |
Language: | Japanese | |
Language (ISO639): | jpn | |
Medium: | Not specified | |
Publisher: | ELRA (European Language Resources Association) | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ELRA Catalogue of Language Resources | |
Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:catalogue.elra.info:ELRA-S0412 | |
DateStamp: | 2020-10-08 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | n.a. 2020. ELRA (European Language Resources Association). | |
Terms: | area_Asia country_JP dcmi_Sound iso639_jpn olac_primary_text |