OLAC Record oai:catalogue.elra.info:ELRA-W0232 |
Metadata | ||
Title: | Maltese-English website parallel corpus (Processed) | |
Access Rights: | Rights available for: other | |
Date Available (W3CDTF): | 2020-02-27 | |
Date Issued (W3CDTF): | 2020-02-27 | |
Date Modified (W3CDTF): | 2018-09-30 | |
Description: | This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu.This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 26,622 TUs. Date of crawling : 16/12/2016 A strict validation process has been followed, which resulted in discarding:- TUs from crawled websites that do not comply with the PSI directive, - TUs identified during the manual validation process and all the TUs from websites which error rate in the sample extracted for manual validation are strictly above the following thresholds: 50% of TUs with language identification errors, 50% of TUs with alignment errors, 50% of TUs with tokenization errors,20% of TUs identified as machine translated content, 50% of TUs with translation errors. | |
Identifier: | ELRA-W0232 | |
ISLRN: 693-091-524-649-2 | ||
Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-W0232/ | |
Language: | English | |
Maltese | ||
Language (ISO639): | eng | |
mlt | ||
Medium: | downloadable | |
Publisher: | ELRA (European Language Resources Association) | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ELRA Catalogue of Language Resources | |
Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:catalogue.elra.info:ELRA-W0232 | |
DateStamp: | 2020-02-27 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | n.a. 2020. ELRA (European Language Resources Association). | |
Terms: | area_Europe country_GB country_MT dcmi_Text iso639_eng iso639_mlt olac_primary_text |