Kader Smail | University of Maryland, College Park (original) (raw)

Kader  Smail

Address: 1102 Francis Scott Key Hall, University of Maryland College Park, MD 20742

less

Related Authors

Valerie Gonzalez

Daniella Talmon-Heller

Benjamin  Kedar

Daniella Talmon-Heller

Richard McGregor

Hagit Nol

Hagit Nol

Goethe-Universität Frankfurt am Main

Stephennie Mulder

Adam Bursi

Uploads

Edited Journals by Kader Smail

Research paper thumbnail of Al-ʿUsur al-Wusta: The Journal of Middle East Medievalists, Volume 29 (2021). (Open access: https://journals.library.columbia.edu/index.php/alusur/issue/view/774)

by Antoine Borrut, Luke Yarbrough, Kader Smail, Liana Saif, Gohar Grigoryan, Michael Pregill, Aurélien Montel, Alberto Bardi, Javier Albarrán, and Sarah Slingluff, PhD

Papers by Kader Smail

Research paper thumbnail of Daniella Talmon-Heller. Sacred Place and Sacred Time in the Medieval Islamic Middle East: A Historical Perspective

Research paper thumbnail of Advances and Limitations in Open Source Arabic-Script OCR: A Case Study

Digital Studies/le champ numérique (DSCN) Open Issue 2021, 2021

This work presents an accuracy study of the open source OCR engine, Kraken, on the leading Arabic... more This work presents an accuracy study of the open source OCR engine, Kraken, on the leading Arabic scholarly journal, al-Abhath. In contrast with other commercially available OCR engines, Kraken is shown to be capable of producing highly accurate Arabic-script OCR. The study also assesses the relative accuracy of typeface-specific and generalized models on the al-Abhath data and provides a microanalysis of the “error instances” and the contextual features that may have contributed to OCR misrecognition. Building on this analysis, the paper argues that Arabic-script OCR can be significantly improved through (1) a more systematic approach to training data production, and (2) the development of key technological components, especially multi-language models and improved line segmentation and layout analysis./Cet article présente une étude d’exactitude du moteur ROC open source, Krakan, sur la revue académique arabe de premier rang, al-Abhath. Contrairement à d’autres moteurs ROC disponib...

Research paper thumbnail of Al-ʿUsur al-Wusta: The Journal of Middle East Medievalists, Volume 29 (2021). (Open access: https://journals.library.columbia.edu/index.php/alusur/issue/view/774)

by Antoine Borrut, Luke Yarbrough, Kader Smail, Liana Saif, Gohar Grigoryan, Michael Pregill, Aurélien Montel, Alberto Bardi, Javier Albarrán, and Sarah Slingluff, PhD

Research paper thumbnail of Daniella Talmon-Heller. Sacred Place and Sacred Time in the Medieval Islamic Middle East: A Historical Perspective

Research paper thumbnail of Advances and Limitations in Open Source Arabic-Script OCR: A Case Study

Digital Studies/le champ numérique (DSCN) Open Issue 2021, 2021

This work presents an accuracy study of the open source OCR engine, Kraken, on the leading Arabic... more This work presents an accuracy study of the open source OCR engine, Kraken, on the leading Arabic scholarly journal, al-Abhath. In contrast with other commercially available OCR engines, Kraken is shown to be capable of producing highly accurate Arabic-script OCR. The study also assesses the relative accuracy of typeface-specific and generalized models on the al-Abhath data and provides a microanalysis of the “error instances” and the contextual features that may have contributed to OCR misrecognition. Building on this analysis, the paper argues that Arabic-script OCR can be significantly improved through (1) a more systematic approach to training data production, and (2) the development of key technological components, especially multi-language models and improved line segmentation and layout analysis./Cet article présente une étude d’exactitude du moteur ROC open source, Krakan, sur la revue académique arabe de premier rang, al-Abhath. Contrairement à d’autres moteurs ROC disponib...

Log In