OLAC Record: Asian Elephant Vocalizations

OLAC Record
oai:www.ldc.upenn.edu:LDC2010S05

Metadata

Title: Asian Elephant Vocalizations

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Shermin de Silva. Asian Elephant Vocalizations LDC2010S05. Web Download. Philadelphia: Linguistic Data Consortium, 2010

Contributor: Shermin de Silva

Date (W3CDTF): 2010

Date Issued (W3CDTF): 2010-08-18

Description: *Introduction* Asian Elephant Vocalizations consists of 57.5 hours of audio recordings of vocalizations by Asian Elephants (Elephas maximus) in the Uda Walawe National Park, Sri Lanka, of which 31.25 hours have been annotated. Voice recording field notes were made by Shermin de Silva and Ashoka Ranjeewa, of the Uda Walawe Elephant Research Project. The collection and annotation of the recordings was conducted and overseen by Shermin de Silva, through the University of Pennsylvania Department of Biology, and Institute for Research in Cognitive Science. The recordings primarily feature adult female, and juvenile elephants. Existing knowledge of acoustic communication in elephants is based mostly on African species (Loxodonta africana and Loxodonta cyclotis). There has been comparatively less study of communication in Asian elephants, primarily becaUse the habitat in which Asian elephants typically live makes them more difficult to study than African forest elephants. For other current elephant vocalization research, see ElephantVoices and the Cornell Lab of Ornithologys Elephant Listening Project. This corpus is intended to enable researchers in acoustic communication to evaluate acoustic features and repertoire diversity of the recorded population. Of particular interest is whether there may be regional dialects that differ among Asian elephant populations in the wild and in captivity. A second interest is in whether structural commonalities exist between this and other species that shed light on underlying social and ecological factors shaping communication systems. *Methods* *Study site and subjects* Uda Walawe National Park (UWNP), Sri Lanka, is located at latitude 630°14.0646N, longitude 80°5428.1268E, and an average altitude of 118 m above sea level. It occupies 308 km2 and contains tall grassland, dense scrub, riparian forest, secondary forest, rivers and seasonal streams. It also contains several natural and man-made water sources and reservoirs with seasonal floodplains. There are two monsoons per calendar year, separated by dry seasons of variable length. Over 300 adult females have been individually identified in UWNP using characteristics of the ears, tail, and other natural markings (Moss, 1996). *Data collection* Data were collected from May, 2006 to December, 2007. Observations were performed by vehicle during park hours from 0600 to 1830 h. Most recordings of vocalizations were made using an Earthworks QTC50 microphone shock-mounted inside a Rycote Zeppelin windshield, via a Fostex FR-2 field recorder (24-bit sample size, sampling rate 48 kHz) connected to a 12 V lead acid battery. Recordings were initiated at the start of a call with a 10-s pre-record buffer so that the entire call was captured and loss of rare vocalizations minimized. This was made possible with the pre-record feature of the Fostex, which records continuously, but only saves the file with a 10-second lead once the record button is depressed. In order to minimize loss of low-frequency or potentially inaudible calls, recording was continued for at least three minutes following the end of vocalization events. During the first two months, hour-long recording sessions were also carried out opportunistically while in close proximity to a group. However, spectrograms showed that few vocalizations were captured therefore, this was discontinued. *Anomalies* Some audio files have 1 channel (field recording) and some have 2 channels (field recordings and field notes). Certain files were recorded at 22050 Hz sample rate: * asian_elephant_voc_d1/data/20070209/B13h00m34s09feb2007y.flac * asian_elephant_voc_d1/data/20070209/B13h10m04s09feb2007y.flac * asian_elephant_voc_d2/data/20070405/B14h56m48s05apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h35m11s09apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h38m34s09apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h39m27s09apr2007y.flac Certain files were recorded at 16 bits per sample: * asian_elephant_voc_d1/data/20070209/B13h00m34s09feb2007y.flac * asian_elephant_voc_d1/data/20070209/B13h10m04s09feb2007y.flac * asian_elephant_voc_d2/data/20070405/B14h56m48s05apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h35m11s09apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h38m34s09apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h39m27s09apr2007y.flac * asian_elephant_voc_d3/data/20070507/B08h37m21s07may2007y.flac * asian_elephant_voc_d4/data/20070822/B08h44m02s22aug2007y.flac * asian_elephant_voc_d4/data/20070822/B08h48m02s22aug2007y.flac * asian_elephant_voc_d4/data/20071015/B12h25m22s15oct2007y.flac * asian_elephant_voc_d4/data/20071015/B12h59m51s15oct2007y.flac * asian_elephant_voc_d5/data/20071024/B16h12m29s24oct2007y.flac One file contains audio extracted from a video recording at 16-bit, 32 kHz. This file may overlap with other audio recordings, but was used to aid annotation because of the density of vocalizations and the number of vocalizing individuals: * asian_elephant_voc_d1/data_from_video/20070724/20070724_g01_vocs.flac *Audio data annotation* Certain audio files were manually annotated, to the extent possible, with call type (see below for a list of categories), caller id, and miscellaneous notes. Annotations were made using the Praat TextGrid Editor, which allows spectral analysis and annotation of audio files with overlapping events. Annotations were based on written and audio-recorded field notes, and in some cases video recordings. Miscellaneous notes are free-form, and include such information as distance from source, caller identity certainty, and accompanying behavior. Audio files that are included without a corresponding Praat TextGrid annotation file have not yet been annotated. *Acoustic features* There are three main categories of vocalizations: those that show clear fundamental frequencies (periodic), those that do not (a-periodic), and those that show periodic and a-periodic regions as at least two distinct segments. Calls were identified as belonging to one of 14 categories: Call Type Abbreviation Growl GRW Squeak SQK Longroar-rumble LRM Longroar LRR Rumble RUM Bark-rumble BRM Trumpet TMP Roar-rumble RRM Roar ROR Bark BRK Squeal SQL Croak-rumble CRM1 Chirp-rumble CRM2 Musth chirp-rumble MCR *Audio compression (FLAC)* All audio wav files in this corpus have been compressed using FLAC (Free Lossless Audio Codec). Becuase FLAC is a lossless compression algorithm, the conversion of the included FLAC files into wav files will result in files that are sample-for-sample identical to the original wav file recordings. Many standard audio tools (including Praat TextGrid Editor) will transparently decompress FLAC files, so that they may be played, processed, and examined as if they were uncompressed audio. Should you wish to explicitly decompress FLAC files (by converting them into wav files), there are many free audio tools capable of performing this conversion. Some such tools, available for all major operating systems, may be found at http://flac.sourceforge.net/download.html The data in this corpus were used by the corpus author as the foundation of a paper, Acoustic communication in the Asian elephant, Elephas maximus maximus (S. de Silva Behaviour, Volume 147, Number 7, 2010, pp. 825-852). If you have trouble accessing the paper through the preceding link, you may contact the corpus author directly for assistance. *Sample* A sample of data available in this corpus: Audio recording Praat TextGrid Annotation *Updates* No updates are available at this time.

Extent: Corpus size: 22764960 KB

Format: Sampling Rate: 48000

Sampling Format: 24 bit PCM

Identifier: LDC2010S05

https://catalog.ldc.upenn.edu/LDC2010S05

ISBN: 1-58563-557-X

ISLRN: 034-349-578-913-7

DOI: 10.35111/7sj2-2437

Language: No linguistic content, Not applicable

Language (ISO639): zxx

License: LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Rights Holder: Portions © 2010 Trustees of the University of Pennsylvania.

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC2010S05

DateStamp: 2025-09-30

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Shermin de Silva. 2010. Linguistic Data Consortium.
Terms: dcmi_Sound iso639_zxx olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2010S05
Up-to-date as of: Wed Oct 29 7:01:12 EDT 2025

Metadata
Title:		Asian Elephant Vocalizations
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Shermin de Silva. Asian Elephant Vocalizations LDC2010S05. Web Download. Philadelphia: Linguistic Data Consortium, 2010
Contributor:		Shermin de Silva
Date (W3CDTF):		2010
Date Issued (W3CDTF):		2010-08-18
Description:		Introduction Asian Elephant Vocalizations consists of 57.5 hours of audio recordings of vocalizations by Asian Elephants (Elephas maximus) in the Uda Walawe National Park, Sri Lanka, of which 31.25 hours have been annotated. Voice recording field notes were made by Shermin de Silva and Ashoka Ranjeewa, of the Uda Walawe Elephant Research Project. The collection and annotation of the recordings was conducted and overseen by Shermin de Silva, through the University of Pennsylvania Department of Biology, and Institute for Research in Cognitive Science. The recordings primarily feature adult female, and juvenile elephants. Existing knowledge of acoustic communication in elephants is based mostly on African species (Loxodonta africana and Loxodonta cyclotis). There has been comparatively less study of communication in Asian elephants, primarily becaUse the habitat in which Asian elephants typically live makes them more difficult to study than African forest elephants. For other current elephant vocalization research, see ElephantVoices and the Cornell Lab of Ornithologys Elephant Listening Project. This corpus is intended to enable researchers in acoustic communication to evaluate acoustic features and repertoire diversity of the recorded population. Of particular interest is whether there may be regional dialects that differ among Asian elephant populations in the wild and in captivity. A second interest is in whether structural commonalities exist between this and other species that shed light on underlying social and ecological factors shaping communication systems. Methods Study site and subjects Uda Walawe National Park (UWNP), Sri Lanka, is located at latitude 630°14.0646N, longitude 80°5428.1268E, and an average altitude of 118 m above sea level. It occupies 308 km2 and contains tall grassland, dense scrub, riparian forest, secondary forest, rivers and seasonal streams. It also contains several natural and man-made water sources and reservoirs with seasonal floodplains. There are two monsoons per calendar year, separated by dry seasons of variable length. Over 300 adult females have been individually identified in UWNP using characteristics of the ears, tail, and other natural markings (Moss, 1996). Data collection Data were collected from May, 2006 to December, 2007. Observations were performed by vehicle during park hours from 0600 to 1830 h. Most recordings of vocalizations were made using an Earthworks QTC50 microphone shock-mounted inside a Rycote Zeppelin windshield, via a Fostex FR-2 field recorder (24-bit sample size, sampling rate 48 kHz) connected to a 12 V lead acid battery. Recordings were initiated at the start of a call with a 10-s pre-record buffer so that the entire call was captured and loss of rare vocalizations minimized. This was made possible with the pre-record feature of the Fostex, which records continuously, but only saves the file with a 10-second lead once the record button is depressed. In order to minimize loss of low-frequency or potentially inaudible calls, recording was continued for at least three minutes following the end of vocalization events. During the first two months, hour-long recording sessions were also carried out opportunistically while in close proximity to a group. However, spectrograms showed that few vocalizations were captured therefore, this was discontinued. Anomalies Some audio files have 1 channel (field recording) and some have 2 channels (field recordings and field notes). Certain files were recorded at 22050 Hz sample rate: * asian_elephant_voc_d1/data/20070209/B13h00m34s09feb2007y.flac * asian_elephant_voc_d1/data/20070209/B13h10m04s09feb2007y.flac * asian_elephant_voc_d2/data/20070405/B14h56m48s05apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h35m11s09apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h38m34s09apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h39m27s09apr2007y.flac Certain files were recorded at 16 bits per sample: * asian_elephant_voc_d1/data/20070209/B13h00m34s09feb2007y.flac * asian_elephant_voc_d1/data/20070209/B13h10m04s09feb2007y.flac * asian_elephant_voc_d2/data/20070405/B14h56m48s05apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h35m11s09apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h38m34s09apr2007y.flac * asian_elephant_voc_d2/data/20070409/B14h39m27s09apr2007y.flac * asian_elephant_voc_d3/data/20070507/B08h37m21s07may2007y.flac * asian_elephant_voc_d4/data/20070822/B08h44m02s22aug2007y.flac * asian_elephant_voc_d4/data/20070822/B08h48m02s22aug2007y.flac * asian_elephant_voc_d4/data/20071015/B12h25m22s15oct2007y.flac * asian_elephant_voc_d4/data/20071015/B12h59m51s15oct2007y.flac * asian_elephant_voc_d5/data/20071024/B16h12m29s24oct2007y.flac One file contains audio extracted from a video recording at 16-bit, 32 kHz. This file may overlap with other audio recordings, but was used to aid annotation because of the density of vocalizations and the number of vocalizing individuals: * asian_elephant_voc_d1/data_from_video/20070724/20070724_g01_vocs.flac Audio data annotation Certain audio files were manually annotated, to the extent possible, with call type (see below for a list of categories), caller id, and miscellaneous notes. Annotations were made using the Praat TextGrid Editor, which allows spectral analysis and annotation of audio files with overlapping events. Annotations were based on written and audio-recorded field notes, and in some cases video recordings. Miscellaneous notes are free-form, and include such information as distance from source, caller identity certainty, and accompanying behavior. Audio files that are included without a corresponding Praat TextGrid annotation file have not yet been annotated. Acoustic features There are three main categories of vocalizations: those that show clear fundamental frequencies (periodic), those that do not (a-periodic), and those that show periodic and a-periodic regions as at least two distinct segments. Calls were identified as belonging to one of 14 categories: Call Type Abbreviation Growl GRW Squeak SQK Longroar-rumble LRM Longroar LRR Rumble RUM Bark-rumble BRM Trumpet TMP Roar-rumble RRM Roar ROR Bark BRK Squeal SQL Croak-rumble CRM1 Chirp-rumble CRM2 Musth chirp-rumble MCR Audio compression (FLAC) All audio wav files in this corpus have been compressed using FLAC (Free Lossless Audio Codec). Becuase FLAC is a lossless compression algorithm, the conversion of the included FLAC files into wav files will result in files that are sample-for-sample identical to the original wav file recordings. Many standard audio tools (including Praat TextGrid Editor) will transparently decompress FLAC files, so that they may be played, processed, and examined as if they were uncompressed audio. Should you wish to explicitly decompress FLAC files (by converting them into wav files), there are many free audio tools capable of performing this conversion. Some such tools, available for all major operating systems, may be found at http://flac.sourceforge.net/download.html The data in this corpus were used by the corpus author as the foundation of a paper, Acoustic communication in the Asian elephant, Elephas maximus maximus (S. de Silva Behaviour, Volume 147, Number 7, 2010, pp. 825-852). If you have trouble accessing the paper through the preceding link, you may contact the corpus author directly for assistance. Sample A sample of data available in this corpus: Audio recording Praat TextGrid Annotation Updates No updates are available at this time.
Extent:		Corpus size: 22764960 KB
Format:		Sampling Rate: 48000
Format:		Sampling Format: 24 bit PCM
Identifier:		LDC2010S05
		https://catalog.ldc.upenn.edu/LDC2010S05
		ISBN: 1-58563-557-X
		ISLRN: 034-349-578-913-7
		DOI: 10.35111/7sj2-2437
Language:		No linguistic content, Not applicable
Language (ISO639):		zxx
License:		LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Rights Holder:		Portions © 2010 Trustees of the University of Pennsylvania.
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC2010S05
DateStamp:		2025-09-30
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Shermin de Silva. 2010. Linguistic Data Consortium.
Terms:		dcmi_Sound iso639_zxx olac_primary_text