![]() |
OLAC Record oai:www.ldc.upenn.edu:LDC2023T10 |
| Metadata | ||
| Title: | AIDA Scenario 1 and 2 Reference Knowledge Base | |
| Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
| Bibliographic Citation: | Tracey, Jennifer, et al. AIDA Scenario 1 and 2 Reference Knowledge Base LDC2023T10. Web Download. Philadelphia: Linguistic Data Consortium, 2023 | |
| Contributor: | Tracey, Jennifer | |
| Strassel, Stephanie | ||
| Getman, Jeremy | ||
| Bies, Ann | ||
| Griffitt, Kira | ||
| Graff, David | ||
| Caruso, Christopher | ||
| Date (W3CDTF): | 2023 | |
| Date Issued (W3CDTF): | 2023-10-16 | |
| Description: | *Introduction* AIDA Scenario 1 and 2 Reference Knowledge Base was developed by the Linguistic Data Consortium (LDC) and contains the English knowledge base (KB) used for all AIDA entity linking annotation in Scenario 1 (Russia-Ukraine Relations) and Scenario 2 (Crisis in Venezuela). The KB content was drawn from GeoNames, the CIA World Leaders List and the CIA World Factbook and was supplemented with manually-created KB entries developed specifically for AIDA data. The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages. Each phase of the AIDA program focused on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 1 scenario focused on political relations between Russia and Ukraine in the 2010s. The socioeconomic and political crisis in Venezuela since 2010 was the scenario in Phase 2. *Data* This knowledge base supported the AIDIA entity detection and linking task for 13 entity types: GPE (Geo-Political Entity), LOC (Location), PER (Person), ORG (Organization), FAC (Facility), MHI (Medical/Health Issue), WEA (Weapon), SID (Side), COM (Commodity), CRM (Crime), LAW (Law), VEH (Vehicle), and BAL (Ballot). There are four inputs to the KB: GPE and LOC entities from GeoNames (GEO), PER entities from the CIA World Leaders List (WLL), ORG entities from Appendix B of the CIA World Factbook (APB), and additional entities manually created by LDC. The GEO, WLL and APB entries are also found in LORELEI Entity Detection and Linking Knowledge Base (LDC2010T10). *Acknowledgement* This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013. *Samples* Please view the following samples: * Alternate Names Sample * Entities Sample * Member States Sample *Updates* None at this time. | |
| Extent: | Corpus size: 805034 KB | |
| Identifier: | LDC2023T10 | |
| https://catalog.ldc.upenn.edu/LDC2023T10 | ||
| ISLRN: 644-411-403-964-6 | ||
| DOI: 10.35111/3wzr-h616 | ||
| Language: | English | |
| Language (ISO639): | eng | |
| License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
| Medium: | Distribution: Web Download | |
| Publisher: | Linguistic Data Consortium | |
| Publisher (URI): | https://www.ldc.upenn.edu | |
| Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2023T10 | |
| Rights Holder: | Portions © 2023 Trustees of the University of Pennsylvania | |
| Type (DCMI): | Text | |
| Type (OLAC): | primary_text | |
OLAC Info |
||
| Archive: | The LDC Corpus Catalog | |
| Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
| GetRecord: | OAI-PMH request for OLAC format | |
| GetRecord: | Pre-generated XML file | |
OAI Info |
||
| OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2023T10 | |
| DateStamp: | 2024-01-01 | |
| GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
| Citation: | Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher. 2023. Linguistic Data Consortium. | |
| Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text | |