RO  EN
IMCS/Publications/CSJM/Issues/CSJM v.25, n.2 (74), 2017/

On Digitization of Romanian Cyrillic Printings of the 17th-18th Centuries

Authors: Cojocaru Svetlana, Colesnicov Alexandru, Malahov Ludmila, Tudor Bumbu, Stefan Ungur

Abstract

The paper describes in details recognition of Romanian texts of the 17th–18th centuries printed in the Cyrillic script, and their conversion to the modern Latin script. The challenges are dis- cussed, and solutions of problems are proposed. The elaborated technology and a tool pack include historical alphabets, sets of recognition patterns, and spelling dictionaries in the corresponding orthographies for ABBYY Finereader. In addition, virtual keyboards, fonts, a transliteration utility, and the user manual were developed. This permits successful recognition of old Romanian texts in the Cyrillic script. Transliteration to the Latin script grants no- barrier access to historical documents.

nstitute of Mathematics and Computer Science
Str. Academiei 5,
Chişinău, MD-2028,
Moldova
Phone: +373 22 72 59 82
E-mail: ,, , ,



Fulltext

Adobe PDF document0.31 Mb