SSIG and IRISA at Multimodal Person Discovery - IMT - Institut Mines-Télécom Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

SSIG and IRISA at Multimodal Person Discovery

Résumé

This paper describes our approach and results in the multi-modal person discovery in broadcast TV task at MediaEval 2015. We investigate two distinct aspects of multimodal person discovery. One refers to face clusters, which are considered to propagate names associated to faces in one shot to other faces that probably belong to the same person. The face clustering approach consists in calculating face similarities using partial least squares (PLS) and a simple hierarchical approach. The other aspect refers to tag propagation in a graph-based approach where nodes are speaking faces and edges link similar faces/speakers. The advantage of the graph-based tag propagation is to not rely on face/speaker clustering, which we believe can be errorprone.
Fichier principal
Vignette du fichier
mediaeval.pdf (126.91 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01196171 , version 1 (09-09-2015)

Identifiants

  • HAL Id : hal-01196171 , version 1

Citer

Cassio Elias dos Santos Jr., Guillaume Gravier, William Robson Schwartz. SSIG and IRISA at Multimodal Person Discovery. Working Notes Proceedings of the MediaEval Workshop, 2015, Wurzen, Germany. ⟨hal-01196171⟩
369 Consultations
133 Téléchargements

Partager

Gmail Facebook X LinkedIn More