A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting

Sébastien Fenet; Gael Richard; Yves Grenier

Communication Dans Un Congrès Année : 2011

A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting

(1) , (1) , (1)

Sébastien Fenet

Fonction : Auteur

Laboratoire Traitement et Communication de l'Information

Gael Richard

Fonction : Auteur
PersonId : 14146
IdHAL : gael-richard
IdRef : 094977208

Laboratoire Traitement et Communication de l'Information

Yves Grenier

Fonction : Auteur
PersonId : 742871
IdHAL : yves-grenier
IdRef : 087396513

Laboratoire Traitement et Communication de l'Information

Résumé

Audio fingerprint techniques should be robust to a variety of distortions due to noisy transmission channels or specific sound processing. Although most of nowadays techniques are robust to the majority of them, the quasi-systematic use of a spectral representation makes them possibly sensitive to pitch-shifting. This distortion indeed induces a modification of the spectral content of the signal. In this paper, we propose a novel fingerprint technique, relying on a hashing technique coupled with a CQT-based fingerprint, with a strong robustness to pitch-shifting. Furthermore, we have associated this method with an efficient post-processing for the removal of false alarms. We also present the adaptation of a database pruning technique to our specific context. We have evaluated our approach on a real-life broadcast monitoring scenario. The analyzed data consisted of 120 hours of real radio broadcast (thus containing all the distortions that would be found in an industrial context). The reference database consisted of 30.000 songs. Our method, thanks to its increased robustness to pitch-shifting, shows an excellent detection score.

Domaines

Acoustique [physics.class-ph] Acoustique [physics.class-ph]

Admin Télécom Paristech : Connectez-vous pour contacter le contributeur

https://imt.hal.science/hal-00657657

Soumis le : dimanche 8 janvier 2012-13:18:56

Dernière modification le : lundi 9 octobre 2023-12:49:40

Dates et versions

hal-00657657 , version 1 (08-01-2012)

Identifiants

HAL Id : hal-00657657 , version 1

Citer

Sébastien Fenet, Gael Richard, Yves Grenier. A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting. ISMIR, Oct 2011, Miami, United States. pp.121-126. ⟨hal-00657657⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS PARISTECH LTCI IDS S2A

166 Consultations

0 Téléchargements

A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager