Search for a command to run...
This dataset contains eleven digitized books containing Ukrainian folk songs published in the twentieth and twenty-first centuries. For each printed volume, two structured CSV files are provided: 1. a file containing the full texts;2. a file containing metadata (when such information was available in the original edition). The corpus consists of eight collections of Ukrainian epic songs (dumas) and three collections of folk songs from the Podillia region, representing various genres. Each text file preserves the internal structure of the respective printed source. Where documented in the original publications, the metadata include genre, year and place of recording, collector, performer, and biographical details of the respondent (age or year of birth). The Podillia region materials are linguistically closer to standard Ukrainian due to editorial normalization in the source editions. The epic dumas preserve complex narrative structures, formulaic language, and performance-driven variation, and reflect multiple historical orthographic systems, including Russian-based orthography of the late 19th century, as well as several Ukrainian spelling norms (maksymovychivka, kulishivka, dragomanivka, and zhelehivka).