OLAC Record oai:lindat.mff.cuni.cz:11372/LRT-4763 |
Metadata | ||
Title: | Manual Arabic spelling-errors correction for collected documents | |
Bibliographic Citation: | http://hdl.handle.net/11372/LRT-4763 | |
Creator: | Saty, Ahmed | |
Aouragh, Si Lhoussain | ||
Bouzoubaa, Karim | ||
Date (W3CDTF): | 2023-05-09T09:27:45Z | |
Date Available: | 2023-05-09T09:27:45Z | |
Description: | The file represents a text corpus in the context of Arabic spell checking, where a group of persons edited different files, and all of the committed spelling errors by these persons have been recorded. A comprehensive representation these persons’ profile has been considered: male, female, old-aged, middle-aged, young-aged, high and low computer usage users, etc. Through this work, we aim to help researchers and those interested in Arabic NLP by providing them with an Arabic spell check corpus ready and open to exploitation and interpretation. This study also enabled the inventory of most spelling mistakes made by editors of Arabic texts. This file contains the following sections (tags): people – documents they printed – types of possible errors – errors they made. Each section (tag) contains some data that explains its details and its content, which helps researchers extracting research-oriented results. The people section contains basic information about each person and its relationship of using the computer, while the documents section clarifies all sentences in each document with the numbering of each sentence to be used in the errors section that was committed. We are also adding the “type of errors” section in which we list all the possible errors with their description in the Arabic language and give an illustrative example. | |
Identifier (URI): | http://hdl.handle.net/11372/LRT-4763 | |
Language: | English | |
Arabic | ||
Language (ISO639): | eng | |
ara | ||
Publisher: | Sudan University of Science and Technology | |
Rights: | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) | |
http://creativecommons.org/licenses/by-sa/4.0/ | ||
Subject: | Manual Arabic spelling-errors correction for collected documents | |
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11372/LRT-4763 | |
DateStamp: | 2023-05-09 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Saty, Ahmed; Aouragh, Si Lhoussain; Bouzoubaa, Karim. 2023. Sudan University of Science and Technology. | |
Terms: | area_Europe country_GB dcmi_Text iso639_ara iso639_eng olac_primary_text |