Abstract
•Exogenous variables integrated into service processes can be used within modern accounting and assurance operational services.•This study brings an innovative data source, social media information, to government accounting information systems as part of the service evaluation and assessment factor.•Social media information is analyzed using text mining techniques and machine learning algorithms.•The study presents an analytical approach to classify the tweets and uses VADER to derive public opinion about street cleanliness.
This study demonstrates a way of bringing an innovative data source, social media information, to the government accounting information systems to support accountability to stakeholders and managerial decision-making. Future accounting and auditing processes will heavily rely on multiple forms of exogenous data. As an example of the techniques that could be used to generate this needed information, the study applies text mining techniques and machine learning algorithms to Twitter data. The information is developed as an alternative performance measure for NYC street cleanliness. It utilizes Naïve Bayes, Random Forest, and XGBoost to classify the tweets, illustrates how to use the sampling method to solve the imbalanced class distribution issue, and uses VADER sentiment to derive the public opinion about street cleanliness. This study also extends the research to another social media platform, Facebook, and finds that the incremental value is different between the two social media platforms. This data can then be linked to government accounting information systems to evaluate costs and provide a better understanding of the efficiency and effectiveness of operations.