Optimisation of HEVC motion estimation exploiting SAD and SSD GPU-based implementation

Randa Khemiri; Hassan Kibeya; Fatma Ezahra Sayadi; Nejmeddine Bahri; Mohamed Atri; Nouri Masmoudi

doi:10.1049/iet-ipr.2017.0474

Back

Optimisation of HEVC motion estimation exploiting SAD and SSD GPU-based implementation

Journal article

Open access

Peer reviewed

Optimisation of HEVC motion estimation exploiting SAD and SSD GPU-based implementation

Randa Khemiri, Hassan Kibeya, Fatma Ezahra Sayadi, Nejmeddine Bahri, Mohamed Atri and Nouri Masmoudi

IET image processing, Vol.12(2), pp.243-253

01/02/2018

DOI: https://doi.org/10.1049/iet-ipr.2017.0474

Abstract

Computer Science

Computer Science, Artificial Intelligence

Engineering

Engineering, Electrical & Electronic

Imaging Science & Photographic Technology

Science & Technology

Technology

The new High-Efficiency Video Coding (HEVC) standard doubles the video compression ratio compared to the previous H.264/AVC at the same video quality and without any degradation. However, this important performance is achieved by increasing the encoder computational complexity. That's why HEVC complexity is a crucial subject. The most time consuming and the most intensive computing part of HEVC is the motion estimation based principally on the sum of absolute differences (SAD) or the sum of square differences (SSD) algorithms. For these reasons, the authors proposed an implementation of these algorithms on a low cost NVIDIA GPU (graphics processing unit) using the Fermi architecture developed with Compute Unified Device Architecture language. The proposed algorithm is based on the parallel-difference and the parallel-reduction process. The investigational results show a significant speed-up in terms of execution time for most 64x64 pixel blocks. In fact, the proposed parallel algorithm permits a significant reduction in the execution time that reaches up to 56.17 and 30.4%, compared to the CPU, for SAD and SSD algorithms, respectively. This improvement proves that parallelising the algorithm with the new proposed reduction process for the Fermi-GPU generation leads to better results. These findings are based on a static study that determines the PU percentage utilisation for each dimension in the HEVC. This study shows that the larger PUs are the most utilised in temporal levels 3 and 4, which attain 84.56% for class E. This improvement is accompanied by an average peak signal-to-noise ratio loss of 0.095dB and a decrease of 0.64% in terms of BitRate.

Files and links (1)

url

https://doi.org/10.1049/iet-ipr.2017.0474View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Optimisation of HEVC motion estimation exploiting SAD and SSD GPU-based implementation
Creators - without role: Randa Khemiri - University of Monastir
Hassan Kibeya - University of Sfax
Fatma Ezahra Sayadi - University of Monastir
Nejmeddine Bahri - University of Sfax
Mohamed Atri - University of Monastir
Nouri Masmoudi - University of Sfax
Publication Details: IET image processing, Vol.12(2), pp.243-253
Publisher: Inst Engineering Technology-Iet
Number of pages: 11
Identifiers: 9923165508331
Academic Unit: King Khalid University
Language: English
Resource Type: Journal article