new_dataset.txt - Contains list of all track Echo Nest ID. The format is: track idsong idartist namesong title
mxm_dataset_train.txt - Contains a list of songs ids with the occurence of a specified list of words in the lyrics. <Link - http://millionsongdataset.com/sites/default/files/AdditionalFiles/mxm_dataset_train.txt.zip>
mxm_779k_matches.txt - Contains list of songs and their artist defined by a sepecific id. <Link - http://millionsongdataset.com/sites/default/files/AdditionalFiles/mxm_779k_matches.txt.zip>
Note: I have created yug_dataset.txt and yug_test.txt for testing and debugging purposes with a small dataset.