Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic Modeling

Authors: Neculai Curteanu, Cojocaru Svetlana, Eugenia Burcă
Keywords: new approach to dictionary entry parsing; the parsing method of SCD configurations; parsing the largest Romanian, German, French, and Russian dictionaries; lexicographic modeling.


This paper extends the experience of parsing other five, sensibly different, Romanian, French, and German largest dictionaries, to DMLRL (Dictionary of Modern Literary Russian Language) [18], using the optimal and portable parsing method of SCD (Segmentation-Cohesion-Dependency) configurations [7], [11], [15]. The purpose of the present paper is to elaborate the lexicographic modeling of DMLRL, which necessarily precedes the sense tree parsing dictionary entries. The following three SCD configurations are described: the first one has to separate the lexicographic segments in a DMLRL entry, the second SCD-configuration concentrates on the SCD marker classes and their hypergraph hierarchy for DMLRL primary and secondary senses, while the third SCD configuration hands down the same modeling process to the atomic sense definitions and their examples-to-definitions. The dependency hypergraph of the third SCD configuration, interconnected to the one of the second SCD configuration, is specified completely at the atomic sense level for the first time, exceeding the SCD configuration modeling for other five dictionaries [15], [14]. Numerous examples from DMLRL and comparison to DLR-DAR Romanian thesaurus-dictionary support the proposed DMLRL lexicographic modeling.

Neculai Curteanu,
Institute of Computer Science, Romanian Academy, Iaşi Branch, Str. Gh. Asachi, Nr. 3, 700483 Iaşi, România E-mail:

Svetlana Cojocaru, Eugenia Burcă,
Institute of Mathematics and Computer Science,
Academy of Sciences of Moldova,
Str. Academiei nr. 5, Chişinău, MD 2028, R. Moldova
E-mail: ,


