OLAC Record oai:catalogue.elra.info:ELRA-W0080 |
Metadata | ||
Title: | NE3L named entities Russian corpus | |
Access Rights: | Rights available for: nonCommercialUse, commercialUse | |
Date Available (W3CDTF): | 2014-09-29 | |
Date Issued (W3CDTF): | 2014-09-29 | |
Date Modified (W3CDTF): | 2014-09-29 | |
Description: | The NE3L project (Named Entities 3 Languages) consisted in annotating several corpora with different languages with named entities. Text format data were extracted from newspapers and deal with various topics. 3 different languages were annotated: Arabic, Chinese and Russian. For this project, 5 named entity categories were taken into account: Person, Place, Organisation, Time and Amount. Each language was concerned only by a subset of these categories, i.e. Arabic was marked up with Time and Amount tags, as well as Russian, whereas Chinese was marked up with Person, Place and Organisation tags.The Russian corpus contains 75,784 words coming from articles extracted from “Izvestia” newspaper, and published in 1995. | |
Identifier: | ELRA-W0080 | |
ISLRN: 024-620-556-146-2 | ||
Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-W0080/ | |
Language: | Russian | |
Language (ISO639): | rus | |
Medium: | Not specified | |
Publisher: | ELRA (European Language Resources Association) | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ELRA Catalogue of Language Resources | |
Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:catalogue.elra.info:ELRA-W0080 | |
DateStamp: | 2014-09-29 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | n.a. 2014. ELRA (European Language Resources Association). | |
Terms: | area_Europe country_RU dcmi_Text iso639_rus olac_primary_text |