Audiovisual synchrony assessment for replay attack detection in talking face biometrics

作者:Boutellaa Elhocine*; Boulkenafet Zinelabidine; Komulainen Jukka; Hadid Abdenour
来源:Multimedia Tools and Applications, 2016, 75(9): 5329-5343.
DOI:10.1007/s11042-015-2848-2

摘要

Audiovisual speech synchrony detection is an important liveness check for talking face verification systems in order to make sure that the input biometric samples are actually acquired from the same source. In prior work, the used visual speech features have been mainly describing facial appearance or mouth shape in frame-wise manner, thus ignoring the lip motion between consecutive frames. Since also the visual speech dynamics are important, we take the spatiotemporal information into account and propose the use of space-time auto-correlation of gradients (STACOG) for measuring the audiovisual synchrony. For evaluating the effectiveness of the proposed approach, a set of challenging and realistic attack scenarios are designed by augmenting publicly available BANCA and XM2VTS datasets with synthetic replay attacks. Our experimental analysis shows that the STACOG features outperform the state of the art, e.g. discrete cosine transform based features, in measuring the audiovisual synchrony.

  • 出版日期2016-5