Dilated CNN for audio-to-score alignment

Instead of training on the MSMD dataset as in the original paper, we train on ASAP with synthetic structural augmentations.

realfolkcode/dcnn-alignment