-
Nilo Pedrazzini deposited Exploiting Cross-Dialectal Gold Syntax for Low-Resource Historical Languages: Towards a Generic Parser for Pre-Modern Slavic on Humanities Commons 2 years, 2 months ago
This paper explores the possibility of improving the performance of specialized parsers for pre- modern Slavic by training them on data from different related varieties. Because of their linguistic heterogeneity, pre-modern Slavic varieties are treated as low-resource historical languages, whereby cross-dialectal treebank data may be exploited to…[Read more]
-
Nilo Pedrazzini deposited OldSlavNet: A scalable Early Slavic dependency parser trained on modern language data, on Humanities Commons 2 years, 2 months ago
Historical languages are increasingly being modelled computationally. Syntactically annotated texts are often a sine-qua-non in their modelling, but parsing of pre-modern language varieties faces great data sparsity, intensified by high levels of orthographic variation. In this paper we present a good-quality Early Slavic dependency parser,…[Read more]