![]() |
OLAC Record oai:lindat.mff.cuni.cz:11234/1-3023 |
Metadata | ||
Title: | A Speech Test Set of Practice Business Presentations with Additional Relevant Texts | |
Bibliographic Citation: | http://hdl.handle.net/11234/1-3023 | |
Creator: | Macháček, Dominik | |
Kratochvíl, Jonáš | ||
Vojtěchová, Tereza | ||
Bojar, Ondřej | ||
Date (W3CDTF): | 2019-07-15T14:53:51Z | |
Date Available: | 2019-07-15T14:53:51Z | |
Description: | We present a test corpus of audio recordings and transcriptions of presentations of students' enterprises together with their slides and web-pages. The corpus is intended for evaluation of automatic speech recognition (ASR) systems, especially in conditions where the prior availability of in-domain vocabulary and named entities is benefitable. The corpus consists of 39 presentations in English, each up to 90 seconds long, and slides and web-pages in Czech, Slovak, English, German, Romanian, Italian or Spanish. The speakers are high school students from European countries with English as their second language. We benchmark three baseline ASR systems on the corpus and show their imperfection. | |
Identifier (URI): | http://hdl.handle.net/11234/1-3023 | |
Language: | English | |
Language (ISO639): | eng | |
Publisher: | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) | |
Rights: | Creative Commons - Attribution 4.0 International (CC BY 4.0) | |
http://creativecommons.org/licenses/by/4.0/ | ||
Subject: | ASR | |
ASR evaluation | ||
speech corpus | ||
non-native English | ||
speech recognition | ||
speech recognition evaluation | ||
speech and relevant texts | ||
European non-native English | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11234/1-3023 | |
DateStamp: | 2021-06-29 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Macháček, Dominik; Kratochvíl, Jonáš; Vojtěchová, Tereza; Bojar, Ondřej. 2019. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL). | |
Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text |