OLAC Record
oai:www.ldc.upenn.edu:LDC2025L01

Metadata
Title:Iraqi Arabic - English Lexical Database
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Maamouri, Mohamed, and David Graff. Iraqi Arabic - English Lexical Database LDC2025L01. Web Download. Philadelphia: Linguistic Data Consortium, 2025
Contributor:Maamouri, Mohamed
Graff, David
Date (W3CDTF):2025
Date Issued (W3CDTF):2025-01-15
Description:Iraqi Arabic - English Lexical Database was developed by the Linguistic Data Consortium (LDC). It contains six interrelated tables presenting over 67,000 Iraqi Arabic words as orthographic forms in Arabic script and pronunciation forms in International Phonectic Alphabetic (IPA) format, along with more than 120,000 English tokens. This release is the result of a collaboration with Georgetown University Press to enhance and update three dialectal Arabic dictionaries -- Iraqi, Moroccan and Syrian -- originally published in the 1960s. The Georgetown Dictionary of Iraqi Arabic was published in 2013. That work was based on, and expanded, two dictionaries, A Dictionary of Iraqi Arabic: English-Arabic (Clarity, Stowasser and Wolfe, eds., 2003) and A Dictionary of Iraqi Arabic: Arabic-English (Woodhead and Beene, eds., 2003). The several enhancements developed by LDC in the updated and enhanced dictionary and the lexical database included facilitating comparisons across Arabic dialects and Modern Standard Arabic by providing Arabic script spellings and IPA pronunciations to Iraqi words and phrases; promoting ease of use by language learners and researchers by developing reasonable orthographic conventions for applying the Arabic alphabet to the dialect; and facilitating a user's understanding of morphological and lexical relations by adding information on the linguistic structures of Iraqi Arabic. *Data* The number of entries in each table is as follows: Roots 4,512 Lemmas 17,224 Wordforms 22,988 Multi-word Expressions 261 Definitions 23,834 Phrases 15,714 Each table is presented as a UTF-8 encoded tab-delimited file with Unix-style (line-feed only) line breaks. The documentation accompanying this release includes instructions for combining into one database the tables in this corpus with the tables in Moroccan Arabic - English Lexical Database LDC2023L01. *Acknowledgments* This work was supported by the U.S. Department of Education International Research Studies Program (#P017A0800441) with additional support from GUP and LDC. *Samples* Please view these samples: * Roots * Lemmas * Wordforms * Multi-word expressions * Definitions * Phrases *Updates* None at this time.
Extent:Corpus size: 3180 KB
Identifier:LDC2025L01
https://catalog.ldc.upenn.edu/LDC2025L01
ISLRN: 362-004-101-706-6
DOI: 10.35111/7fr9-g791
Language:Mesopotamian Arabic
English
Language (ISO639):acm
eng
License:Iraqi Arabic - English Lexical Database Agreement: https://catalog.ldc.upenn.edu/license/iraqi-arabic-english-lexical-database-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2025L01
Rights Holder:Portions © 2025 Georgetown University Press, © 2025 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):lexicon

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2025L01
DateStamp:  2025-01-15
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Maamouri, Mohamed; Graff, David. 2025. Linguistic Data Consortium.
Terms: area_Asia area_Europe country_GB country_IQ dcmi_Text iso639_acm iso639_eng olac_lexicon


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2025L01
Up-to-date as of: Thu Jan 16 8:07:53 EST 2025