OLAC Record
oai:catalogue.elra.info:ELRA-S0496

Metadata
Title:Chinese Kids Speech database (Lower Grade)
Access Rights: Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):2025-07-18
Date Issued (W3CDTF):2025-07-18
Description:The Chinese Kids Speech database (Lower Grade) contains the total recordings of 184 Chinese Kids speakers (98 males and 86 females), from 6 to 10 years’ old recorded in quiet rooms using smartphone. This database may be combined with the Chinese Kids Speech database (Upper Grade) also available in the ELRA Catalogue under reference ELRA-S0497.Number of speakers, utterances, duration and age are as follows :Number of speakers (Male/Female): 184 (98/86)Number of utterances (average): 237 utt/spkrTotal number of utterances: 43,667Age: from 6 to 10Total hours of data: 871,426 sentences were used. Recordings were made through smartphones and audio data stored in .wav files as sequences of 16KHz Mono, 16 bits, Linear PCM.Database・Audio data: WAV format, 16KHz, 16bit, mono (recorded with smartphone)・Transcription data: TSV format(tab-delimited), UTF-8 (without BOM) ), Line ending: LF・Size: 9.4GBAgeMaleFemaleTotal611617711819818294794736831011718Structure of database :├─ readme.txt├─ Chinese Kids Speech Database (Lower grade).pdfDescription document of the database├─ transcription(Lower).tsvTranscription└─ Low/directory of audio data └─ (1st/2nd/3rd)directory of version ID └─(0/1)directory of gender (0: male, 1: female) └─(audio_file)audio file (WAV format, 16KHz, 16bit, mono)Field information of “transcription(Lower).tsv” are as follows:Field numberContents0Script ID1Speaker ID2Audio file name3Transcription (in Chinese)File naming conventions of audio files are as follows:Field numberContentsDescriptionRemarks0Script IDFour digitsXXXX: four digits1Speaker IDThree digitsXXX: three digits2AgeTwo digitsFrom 06 to 103Gender0: male, 1: female4Utterance No.Three digitsSequential numbering starting from 001 within each speaker5Recording dateYYYYMMDDHHMM6Recording device nameRecording device nameEx. NTH-AN007OSOperating System info of recording deviceEx. android-118Durationduration in msecDuration of the actual spoken utteranceFiled separation character is “_”.For example, if the audio file name is “1318_373_09_1_010_202205041857_NTH-AN00_android-11_5480.wav “, this file has the following meaning:1318: script ID373: speaker ID09: age (nine years old)1: gender (female)010: utterance number202205041857: recording date (May 4, 2022, at 6:57 PM)NTH-AN00: recording device nameandroid-11: operating system info of recording device5480: duration of the actual spoken utterance (5,480 msec)
Identifier:ELRA-S0496
ISLRN: 369-011-475-593-5
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-S0496/
Language:Chinese
Language (ISO639):zho
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0496
DateStamp:  2025-07-18
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2025. ELRA (European Language Resources Association).
Terms: dcmi_Sound iso639_zho olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0496
Up-to-date as of: Thu Aug 21 1:01:03 EDT 2025