SSIG and IRISA at Multimodal Person Discovery

Cassio Elias dos Santos Jr.; Guillaume Gravier; William Robson Schwartz

Communication Dans Un Congrès Année : 2015

SSIG and IRISA at Multimodal Person Discovery

(1) , (2) , (1)

1
2

Cassio Elias dos Santos Jr.

Fonction : Auteur

Departamento de Ciência da Computação [Minas Gerais]

Guillaume Gravier

Fonction : Auteur
PersonId : 1046
IdHAL : guig
ORCID : 0000-0002-2266-5682
IdRef : 110355415

Creating and exploiting explicit links between multimedia fragments

William Robson Schwartz

Fonction : Auteur

Departamento de Ciência da Computação [Minas Gerais]

Résumé

This paper describes our approach and results in the multi-modal person discovery in broadcast TV task at MediaEval 2015. We investigate two distinct aspects of multimodal person discovery. One refers to face clusters, which are considered to propagate names associated to faces in one shot to other faces that probably belong to the same person. The face clustering approach consists in calculating face similarities using partial least squares (PLS) and a simple hierarchical approach. The other aspect refers to tag propagation in a graph-based approach where nodes are speaking faces and edges link similar faces/speakers. The advantage of the graph-based tag propagation is to not rely on face/speaker clustering, which we believe can be errorprone.

Domaines

Multimédia [cs.MM] Son [cs.SD] Traitement du signal et de l'image [eess.SP] Traitement du texte et du document Vision par ordinateur et reconnaissance de formes [cs.CV] Apprentissage [cs.LG]

Fichier principal

mediaeval.pdf (126.91 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Guillaume Gravier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01196171

Soumis le : mercredi 9 septembre 2015-11:37:04

Dernière modification le : mercredi 9 août 2023-09:40:15

Archivage à long terme le : lundi 28 décembre 2015-23:09:54

Dates et versions

hal-01196171 , version 1 (09-09-2015)

Identifiants

HAL Id : hal-01196171 , version 1

Citer

Cassio Elias dos Santos Jr., Guillaume Gravier, William Robson Schwartz. SSIG and IRISA at Multimodal Person Discovery. Working Notes Proceedings of the MediaEval Workshop, 2015, Wurzen, Germany. ⟨hal-01196171⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC IRISA-D6 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

369 Consultations

133 Téléchargements

SSIG and IRISA at Multimodal Person Discovery

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager