Abstract : In this paper, a new class of audio representations is introduced, together with a corresponding fast decomposition algorithm. The main feature of these representations is that they are both sparse and approximately shift-invariant, which allows similarity search in a sparse domain. The common sparse support of detected similar patterns is then used to factorize their representations. The potential of this method for simultaneous structural analysis and compressing tasks is illustrated by preliminary experiments on simple musical data.
https://hal-imt.archives-ouvertes.fr/hal-00696188 Contributor : Admin Télécom ParistechConnect in order to contact the contributor Submitted on : Friday, May 11, 2012 - 10:54:53 AM Last modification on : Sunday, June 26, 2022 - 1:16:57 PM Long-term archiving on: : Sunday, August 12, 2012 - 2:22:24 AM
Manuel Moussallam, L. Daudet, Gael Richard. Audio Signal Representations for Factorization in the sparse domain. ICASSP, May 2011, Prague, Czech Republic. pp.513-516. ⟨hal-00696188⟩