Multiple-Instrument Polyphonic Music Transcription using a Convolutive Probabilistic Model

Publication Type:

Conference Paper


SMC Conference 2011 (2011)


In this paper, a method for automatic transcription of music signals using a convolutive probabilistic model is proposed. The model extends the shift-invariant Probabilistic Latent Component Analysis method. Several note templates from multiple orchestral instruments are extracted from monophonic recordings and are used for training the transcription system. By incorporating shift-invariance into the model along with the constant-Q transform as a time-frequency representation, tuning changes and frequency modulations such as vibrato can be supported by the system. For postprocessing, Hidden Markov Models trained on MIDI data are employed, in order to favour temporal continuity. The system was tested on classical and jazz recordings from the RWC database, on recordings from a Disklavier piano, and a woodwind quintet recording. The proposed method, which can also be used for pitch content visualization, is shown to outperform several state-of-the-art approaches, using a variety of error metrics.

smc2011_submission_51.pdf142.97 KB
SMC paper: