FaceMD: convolutional neural network-based spatiotemporal fusion facial manipulation detection

Mohammed Aloraini

doi:10.1007/s11760-022-02227-x

Back

FaceMD: convolutional neural network-based spatiotemporal fusion facial manipulation detection

Journal article

Peer reviewed

FaceMD: convolutional neural network-based spatiotemporal fusion facial manipulation detection

Mohammed Aloraini

Signal, image and video processing, Vol.17(1), pp.247-255

01/02/2023

DOI: https://doi.org/10.1007/s11760-022-02227-x

Abstract

Computer Imaging

Computer Science

Image Processing and Computer Vision

Multimedia Information Systems

Original Paper

Pattern Recognition and Graphics

Signal,Image and Speech Processing

Vision

Digital videos have become essential to broadcast news that targets many audiences around the world, and it is therefore important to ensure the reliability of these broadcasted videos. Unfortunately, digital videos can be manipulated by replacing a person’s face or expressions with another person’s face or expressions without leaving visible traces. This facial manipulation is a challenging problem due to the lack of digital forensic techniques that can be used to verify the originality of video content. In this paper, we propose a novel approach, dubbed FaceMD, based on fusing three streams of convolutional neural networks to detect facial manipulation. The proposed FaceMD incorporates spatiotemporal information by fusing video frames, motion residuals, and 3D gradients to improve facial manipulation detection accuracy. We combine these three streams using different fusion methods and places to best use this spatiotemporal information, hence increasing detection performance. The experimental results show that the proposed FaceMD achieves state-of-the-art accuracy using two different facial manipulation data sets.

Metrics

1 Record Views

Details

Title: FaceMD: convolutional neural network-based spatiotemporal fusion facial manipulation detection
Creators - without role: Mohammed Aloraini - Qassim University
Publication Details: Signal, image and video processing, Vol.17(1), pp.247-255
Publisher: Springer London
Identifiers: 9928270608331
Academic Unit: Qassim University
Language: English
Resource Type: Journal article