OLAC Record oai:www.ldc.upenn.edu:LDC96S35 |
Metadata | ||
Title: | CALLHOME Spanish Speech | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Canavan, Alexandra, and George Zipperlen. CALLHOME Spanish Speech LDC96S35. Web Download. Philadelphia: Linguistic Data Consortium, 1996 | |
Contributor: | Canavan, Alexandra | |
Zipperlen, George | ||
Date (W3CDTF): | 1996 | |
Description: | *Introduction* The CALLHOME Spanish corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of Spanish. All calls, which lasted up to 30 minutes, originated in North America and were placed to international locations. Most participants called family members or close friends. This corpus contains speech data files ONLY, along with the minimal amount of documentation needed to describe the contents and format of the speech files and the software packages needed to uncompress the speech data. The transcripts and documentation (LDC96T17) are available separately, as is an associated lexicon (LDC96L16). *Samples* Please listen to this audio sample (SPH). *Updates* The "shorten" and "sphere" directories have been removed. The sphere directory contained NIST "SPeech HEader REsources" (SPHERE): C-language source code libraries and utilities for manipulating NIST SPHERE-format waveform files. The shorten directory contained files for Tony Robinson's "shorten" software for speech compression. A more recent version of the SPHERE utilities is now available on the NIST web site; additional utilities for converting from SPHERE to other waveform file formats is also available at the LDC web site. 10.10.2003: It has been brought to our attention that 16 sphere files (both from the train and devtest directories) were corrupted; the problem becomes apparent when trying to decompress the files using the w_decode utility. As of June 12th, 2018, the corrected version of these files are included with the downloadable corpus. Any new downloads after this date will contain the full, corrected speech. | |
Format: | Sampling Rate: 8000 | |
Sampling Format: 2-channel ulaw | ||
Identifier: | LDC96S35 | |
https://catalog.ldc.upenn.edu/LDC96S35 | ||
ISBN: 1-58563-083-7 | ||
ISLRN: 321-477-528-167-2 | ||
DOI: 10.35111/2skn-2002 | ||
Language: | Spanish | |
Language (ISO639): | spa | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC96S35 | |
Rights Holder: | Portions © 1996 Trustees of the University of Pennsylvania | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC96S35 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Canavan, Alexandra; Zipperlen, George. 1996. Linguistic Data Consortium. | |
Terms: | area_Europe country_ES dcmi_Sound iso639_spa olac_primary_text |