Lessons from the Classroom: MEI for Data Scientists

Document Type

Journal Article

Role

Author

Journal Title

Journal of the Text Encoding Initiative

Issue

18

Publication Date

2024

Abstract

Many data science and computer science students today are familiar with JSON, and may even have worked with APIs to extract data from the web. Ask about XML, however, let alone TEI or MEI, and you are often met with quizzical looks. Yet XML files contain much information that can be productively analyzed with modern data science tools, so training students to leverage these materials is a worthwhile endeavor. The article shows some of the methods we use to help students understand XML as a hierarchical network of elements, how to traverse this network in search of relevant data, and how to harvest XML elements and attributes as tabular data for further analysis. It also reflects on some of the larger lessons learned through all of this work, as students were encouraged to consider the implications of representing the same knowledge in different ways, or what is gained or lost in the transformation of that knowledge from one representation to another.

Share

COinS