OLAC Record oai:lindat.mff.cuni.cz:11372/LRT-1661 |
Metadata | ||
Title: | The ACL RD-TEC 2.0 | |
Bibliographic Citation: | http://hdl.handle.net/11372/LRT-1661 | |
Creator: | QasemiZadeh, Behrang | |
Schumann, Anne-Kathrin | ||
Date (W3CDTF): | 2016-03-07T17:34:32Z | |
Date Available: | 2016-03-07T17:34:32Z | |
Description: | The ACL RD-TEC 2.0 has been developed with the aim of providing a benchmark for the evaluation of methods for terminology extraction and classification as well as entity recognition tasks based on specialised text from the computational linguistics domain. This release of the corpus consists of 300 abstracts from articles in the ACL Anthology Reference Corpus, published between 1978--2006. In these abstracts, terms (i.e., single or multi-word lexical units with a specialised meaning) are manually annotated. In addition to their boundaries in running text, annotated terms are classified into one of the seven categories method, tool, language resource (LR), LR product, model, measures and measurements, and other. To assess the quality of the annotations and to determine the difficulty of this task, more than 171 of the abstracts are annotated twice, independently, by each of the two annotators. In total, 6,818 terms are identified and annotated, resulting in a specialised vocabulary made of 3,318 lexical forms, mapped to 3,471 concepts. | |
Identifier (URI): | http://hdl.handle.net/11372/LRT-1661 | |
Language: | English | |
Language (ISO639): | eng | |
Publisher: | DFG Collaborative Research Centre 991, University of Duesseldorf | |
Department of Applied Linguistics, Translation and Interpreting, Saarland University | ||
Rights: | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | |
http://creativecommons.org/licenses/by-nc-sa/4.0/ | ||
Subject: | Terminology | |
Term Extraction | ||
Term Classification | ||
Entity Recognition | ||
Evaluation Corpus | ||
Language Resource | ||
Gold Dataset | ||
Evaluation of Automatic Terminology Construction Methods | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11372/LRT-1661 | |
DateStamp: | 2021-06-29 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | QasemiZadeh, Behrang; Schumann, Anne-Kathrin. 2016. DFG Collaborative Research Centre 991, University of Duesseldorf. | |
Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text |