Synthetic voice forgery in the forensic context: a short tutorial

Abstract : Technical voice forgery in the forensic area has led to several studies, mainly dealing with voice conversion. In the last decade, latests developments around voice synthesis have reached satisfactory intelligibility and quality levels. Moreover, several web-based or standalone apps can be used for TTS. Nowadays, HMM-based synthetic voices can be built to fool biometric systems. Several authors reported FARs as high as 70 to 80% when TTS voices where used. Nevertheless, the forensic context is quite different since the human ear might be able to detect a synthetic voice, thus leading to a case discarding. We used the MARY TTS platform in order to produce a speaker-dependent synthetic target voice sample. Given a single synthetic trial, our very preliminary work is to show how feasible and successful such an attack is. Further work is needed to build other voices and provide accurate statistics. Our aim is to confirm a criminal opportunity and to show that synthetic voice detection might become mandatory in a near future.
Complete list of metadatas

Cited literature [18 references]  Display  Hide  Download

https://hal-imt.archives-ouvertes.fr/hal-00625918
Contributor : Admin Télécom Paristech <>
Submitted on : Thursday, September 22, 2011 - 11:39:40 PM
Last modification on : Wednesday, February 20, 2019 - 2:38:35 PM
Long-term archiving on : Sunday, December 4, 2016 - 10:13:25 AM

File

FSAAWG2011.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00625918, version 1

Collections

Citation

Guillaume Galou, Gérard Chollet. Synthetic voice forgery in the forensic context: a short tutorial. Forensic Speech and Audio Analysis Working Group (ENFSI-FSAAWG), 2011, Rome, Italy. ⟨hal-00625918⟩

Share

Metrics

Record views

1627

Files downloads

436