Lessons from the Classroom: MEI for Data Scientists
Document Type
Journal Article
Role
Author
Journal Title
Journal of the Text Encoding Initiative
Issue
18
Publication Date
2024
Abstract
Many data science and computer science students today are familiar with JSON, and may even have worked with APIs to extract data from the web. Ask about XML, however, let alone TEI or MEI, and you are often met with quizzical looks. Yet XML files contain much information that can be productively analyzed with modern data science tools, so training students to leverage these materials is a worthwhile endeavor. The article shows some of the methods we use to help students understand XML as a hierarchical network of elements, how to traverse this network in search of relevant data, and how to harvest XML elements and attributes as tabular data for further analysis. It also reflects on some of the larger lessons learned through all of this work, as students were encouraged to consider the implications of representing the same knowledge in different ways, or what is gained or lost in the transformation of that knowledge from one representation to another.
Repository Citation
Freedman, R., & Russo-Batterham, D. (2024). Lessons from the Classroom: MEI for Data Scientists. Journal of the Text Encoding Initiative, Issue 18. https://doi.org/10.4000/13e5b
