Sparse Regression in Time-Frequency Representations of Complex Audio

Publication Type:

Conference Paper

Source:

SMC Conference 2010 (2010)

URL:

files/proceedings/2010/16.pdf

Abstract:

Time-frequency representations are commonly used tools for the representation of audio and in particular music sig- nals. From a theoretical point of view, these representa- tions are linked to Gabor frames. Frame theory yields a convenient reconstruction method making post-processing unnecessary. Furthermore, using dual or tight frames in the reconstruction, we may resynthesize localized components from so-called sparse representation coefficients. Sparsity of coefficients is directly reinforced by the application of a l1-penalization term on the coefficients. We introduce an iterative algorithm leading to sparse coefficients and demonstrate the effect of using these coefficients in sev- eral examples. In particular, we are interested in the ability of a sparsity promoting approach to the task of separating components with overlapping analysis coefficients in the time-frequency domain. We also apply our approach to the problem of auditory scene description, i.e. source identifi- cation in a complex audio mixture.