-
Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison
- Author(s):
- Robert Forkel, Russell D. Gray, Simon J. Greenhill, Johann-Mattis List, Christoph Rzymski, Tiago Tresoldi (see profile)
- Date:
- 2019
- Group(s):
- Classical Philology and Linguistics, Digital Humanists, Linguistics
- Subject(s):
- Comparative linguistics, Historical linguistics, Research--Data processing
- Item Type:
- Book chapter
- Tag(s):
- computational historical linguistics, data managment, phylogenetics, Research data management
- Permanent URL:
- http://dx.doi.org/10.17613/pwva-kz72
- Abstract:
- THIS IS A PRE-PRINT, PLEASE CITE IT AS: Tresoldi, Tiago; Rzymski, Christoph; Forkel, Robert; Greenhill, Simon J.; List, Johann-Mattis; and Gray, Russell D. (2019) “Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison (PRE-PRINT)”. Jena: Max-Planck-Institute for the Science of Human History. The popularisation of computer-based methods in comparative linguistics has led to a greater awareness of issues resulting from limited data sustainability and proper data management. In this use-case and its accompanying tutorial, we present principles of data management as applied to computational phylogenetics and computer-assisted language comparison, showcasing the solutions we recommend. Instead of enumerating the many possibilities to code and use linguistic data to conduct a phylogenetic analysis, we illustrate our suggestions for phylogenetic data management in a workflow based on a concrete analysis, showing how data should be managed with the help of a published dataset, exploring the information, file formats, processes, and software involved, explaining and showing how to collect and store cross-linguistic information, how to guarantee that datasets are cross-linguistically comparable, how to store intermediate and final results of the analyses, and how to share data in a reusable form by relying in the tools and principles of the CLDF initiative.
- Notes:
- PLEASE NOTE THAT THIS IS A PRE-PRINT
- Metadata:
- xml
- Published as:
- Book chapter Show details
- Pub. Date:
- August 15th, 2019
- Chapter:
- Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison
- Status:
- Published
- Last Updated:
- 4 years ago
- License:
- All Rights Reserved
- Share this:
-
Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison