Simulating the Impact of Annotation Guidelines and Annotated Data on Extracting App Features from App Reviews

Faiz Ali Shah; Kairit Sirts; Dietmar Pfahl

doi:10.5220/0007909703840396

Back

Conference proceeding

Simulating the Impact of Annotation Guidelines and Annotated Data on Extracting App Features from App Reviews

Faiz Ali Shah, Kairit Sirts and Dietmar Pfahl

ICSOFT: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, pp.384-396

01/01/2019

DOI: https://doi.org/10.5220/0007909703840396

Abstract

Computer Science

Computer Science, Software Engineering

Science & Technology

Technology

The quality of automatic app feature extraction from app reviews depends on various aspects, e.g. the feature extraction method, training and evaluation datasets, evaluation method etc. Annotation guidelines used to guide the annotation of training and evaluation datasets can have a considerable impact to the quality of the whole system but it is one of the aspects that is often overlooked. We conducted a study in which we explore the effects of annotation guidelines to the quality of app feature extraction. We propose several changes to the existing annotation guidelines with the goal of making the extracted app features more useful to app developers. We test the proposed changes via simulating the application of the new annotation guidelines and evaluating the performance of the supervised machine learning models trained on datasets annotated with initial and simulated annotation guidelines. While the overall performance of automatic app feature extraction remains the same as compared to the model trained on the dataset with initial annotations, the features extracted by the model trained on the dataset with simulated new annotations are less noisy and more informative to app developers.

Metrics

1 Record Views

Details

Title: Simulating the Impact of Annotation Guidelines and Annotated Data on Extracting App Features from App Reviews
Creators - without role: Faiz Ali Shah - University of Tartu
Kairit Sirts - University of Tartu
Dietmar Pfahl - University of Tartu
Contributors - without role: M VanSinderen
L Maciaszek
Publication Details: ICSOFT: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, pp.384-396
Publisher: Scitepress
Number of pages: 13
Grant note: Estonian Center of Excellence in ICT research (EXCITE) IUT20-55 / Estonian Research Council
Identifiers: 9910919608331
Academic Unit: Taif University
Language: English
Resource Type: Conference proceeding