Arabic optical character recognition software: A review
- Авторы: Alkhateeb F.1, Abu Doush I.1,2, Albsoul A.1
-
Учреждения:
- Computer Sciences Department
- Computer Science Department
- Выпуск: Том 27, № 4 (2017)
- Страницы: 763-776
- Раздел: Software and Hardware for Pattern Recognition and Image Analysis
- URL: https://journals.rcsi.science/1054-6618/article/view/195253
- DOI: https://doi.org/10.1134/S105466181704006X
- ID: 195253
Цитировать
Аннотация
This paper provides a thorough evaluation of a set of six important Arabic OCR systems available in the market; namely: Abbyy FineReader, Leadtools, Readiris, Sakhr, Tesseract and NovoVerus. We test the OCR systems using a randomly selected images from the well known Arabic Printed Text Image database (250 images from the APTI database) and using a set of 8 images from an Arabic book. The APTI database contains 45.313.600 of both decomposable and non-decomposable word images. In the evaluation, we conduct two tests. The first test is based on usual metrics used in the literature. In the second test, we provide a novel measure for Arabic language, which can be used for other non-Latin languages.
Ключевые слова
Об авторах
Faisal Alkhateeb
Computer Sciences Department
Автор, ответственный за переписку.
Email: alkhateebf@yu.edu.jo
Иордания, Irbid, 21163
Iyad Abu Doush
Computer Sciences Department; Computer Science Department
Email: alkhateebf@yu.edu.jo
Иордания, Irbid, 21163; The State of Kuwait
Abdelraoaf Albsoul
Computer Sciences Department
Email: alkhateebf@yu.edu.jo
Иордания, Irbid, 21163
Дополнительные файлы
