Arabic optical character recognition software: A review


Дәйексөз келтіру

Толық мәтін

Ашық рұқсат Ашық рұқсат
Рұқсат жабық Рұқсат берілді
Рұқсат жабық Тек жазылушылар үшін

Аннотация

This paper provides a thorough evaluation of a set of six important Arabic OCR systems available in the market; namely: Abbyy FineReader, Leadtools, Readiris, Sakhr, Tesseract and NovoVerus. We test the OCR systems using a randomly selected images from the well known Arabic Printed Text Image database (250 images from the APTI database) and using a set of 8 images from an Arabic book. The APTI database contains 45.313.600 of both decomposable and non-decomposable word images. In the evaluation, we conduct two tests. The first test is based on usual metrics used in the literature. In the second test, we provide a novel measure for Arabic language, which can be used for other non-Latin languages.

Авторлар туралы

Faisal Alkhateeb

Computer Sciences Department

Хат алмасуға жауапты Автор.
Email: alkhateebf@yu.edu.jo
Иордания, Irbid, 21163

Iyad Abu Doush

Computer Sciences Department; Computer Science Department

Email: alkhateebf@yu.edu.jo
Иордания, Irbid, 21163; The State of Kuwait

Abdelraoaf Albsoul

Computer Sciences Department

Email: alkhateebf@yu.edu.jo
Иордания, Irbid, 21163

Қосымша файлдар

Қосымша файлдар
Әрекет
1. JATS XML

© Pleiades Publishing, Ltd., 2017