-
OldSlavNet: A scalable Early Slavic dependency parser trained on modern language data,
- Author(s):
- Hanne Eckhoff, Nilo Pedrazzini (see profile)
- Date:
- 2021
- Subject(s):
- Natural language processing (Computer science)
- Item Type:
- Article
- Tag(s):
- computational historical linguistics, Dependency parsing, Early Slavic, Neural Networks, old church slavonic, old east slavic, Treebanks, Natural language processing
- Permanent URL:
- http://dx.doi.org/10.17613/5ebw-bj62
- Abstract:
- Historical languages are increasingly being modelled computationally. Syntactically annotated texts are often a sine-qua-non in their modelling, but parsing of pre-modern language varieties faces great data sparsity, intensified by high levels of orthographic variation. In this paper we present a good-quality Early Slavic dependency parser, attained via manipulation of modern Slavic data to resemble the orthography and morphosyntax of pre-modern varieties. The tool can be deployed to expand historical treebanks, which are crucial for data collection and quantification, and beneficial to downstream NLP tasks and historical text mining.
- Metadata:
- xml
- Published as:
- Journal article Show details
- Pub. DOI:
- https://doi.org/10.1016/j.simpa.2021.100063
- Publisher:
- Elsevier
- Pub. Date:
- 2021
- Journal:
- Software Impacts
- Volume:
- 8
- ISSN:
- 2665-9638
- Status:
- Published
- Last Updated:
- 2 years ago
- License:
- All Rights Reserved
- Share this:
-
OldSlavNet: A scalable Early Slavic dependency parser trained on modern language data,