Automatic Instrument Identification in Polyphonic Music

Course project for DS-GA1003 Machine Learning.


  • Jiyuan Qian
  • Tian Wang
  • Peter Li


MedleyDB, which is available from


We compared Convnets trained on raw audio, MFCC and CQT with traditional MIR methods that extracts Gaussian features from MFCC and its first and second order deltas. Convnets trained on handcrafted features can outperform traditional methods, and that trained on raw audio, though takes much longer training time, can achieve arguably better performance.