Handwritten Text Recognition of Ukrainian Manuscripts in the 21st Century: Possibilities, Challenges, and the Future of the First Generic AI-based Model

dc.contributor.authorTikhonov, Aleksejen_US
dc.contributor.authorRabus, Achimen_US
dc.date.accessioned2025-02-24T18:39:08Z
dc.date.available2025-02-24T18:39:08Z
dc.date.issued2024
dc.description.abstractThis article reports on developing and evaluating a generic Handwritten Text Recognition (HTR) model created for the automatic computer-assisted transcription of Ukrainian handwriting publicly available via the HTR platform Transkribus. The model’s training process encompasses diverse datasets, including historical manuscripts by renowned poets Taras Shevchenko and Lesya Ukrainka, along with private correspondence used for the General Regionally Annotated Corpus of Ukrainian (GRAC) and a diary procured at the Holodomor Museum collection. We evaluate the model’s performance by comparing its theoretical accuracy, with a character error rate (CER) of 4.2%, against its practical efficacy when augmented with an AI-based language model for Ukrainian and a Large Language Model. The model is versatile and functional and can thus be applied for mass-digitization of Ukrainian cultural heritage. In our outlook section, we identify possibilities for further improving the model.en_US
dc.identifier.citationTikhonov A. Handwritten Text Recognition of Ukrainian Manuscripts in the 21st Century: Possibilities, Challenges, and the Future of the First Generic AI-based Model / Aleksej Tikhonov, Achim Rabus // Kyiv-Mohyla Humanities Journal. - 2024. - No. 11: Kant Studies in Ukraine: History and Modernity (to Kant's 300th Anniversary). - P. 226-247. - https://doi.org/10.18523/2313-4895.11.2024.226-247en_US
dc.identifier.issn2313-4895
dc.identifier.urihttps://ekmair.ukma.edu.ua/handle/123456789/33697
dc.identifier.urihttps://doi.org/10.18523/2313-4895.11.2024.226-247
dc.language.isoen_USen_US
dc.relation.sourceKyiv-Mohyla Humanities Journalen_US
dc.statusfirst publisheden_US
dc.subjectUkrainianen_US
dc.subjecthandwritten text recognitionen_US
dc.subjectmanuscriptsen_US
dc.subjecthandwritingen_US
dc.subjectAIen_US
dc.titleHandwritten Text Recognition of Ukrainian Manuscripts in the 21st Century: Possibilities, Challenges, and the Future of the First Generic AI-based Modelen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Tikhonov_Handwritten_Text_Recognition_of_Ukrainian_Manuscripts_in_the_21st_Century_Possibilities_Challenges_and_the_Future_of_the_First_Generic_AI-based_Model
Size:
9.21 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: