I ended up cleaning up the data I had by writing several batch (.bat) files to pull the songs out of thousands of folders. I also downloaded some MIDI file collections from Kaggle, Google Datasets, and various MIDI websites, but most of these either had few songs, were of low quality, or both.
I found a torrent of 130,000 MIDI files that formed the basis of my dataset.
Data collection was slightly insane, because I wanted massive amounts of MIDI files, but they had to be labeled by genre, and had to have some modicum of quality (MIDI files can be hit or miss).