Abstract
•Reliable authentication of the genuine audio to grant access.•Detection of splicing using audio(s) of the same environment and microphone.•Capability to detect splicing when environments and microphones are different.•Potential deployment in IoT platforms due to high accuracy and reliability.
The transmission of audio data via the Internet of Things makes such data vulnerable to tampering. Moreover, the availability of sophisticated tampering tools has allowed mobsters to change the context of audio data by altering their segments. Tampered audio may result in unpleasant situations for any member of society. To avoid such circumstances, a new audio forgery detection system is proposed in this study. This system can be deployed in edge devices to identify impostors and tampering in audio data. The proposed system is implemented using state-of-the-art mel-frequency cepstral coefficient features. Meanwhile, a Gaussian mixture model is used to train and validate the system. To evaluate the proposed system, a dataset of tampered audios is created by mixing recordings from two different speakers. The performance of the proposed system in authenticating genuine audio is between 92.50% and 100%, and that in detecting forged audio is between 99.90 and 100%.
[Display omitted]