Time-sensitive clustering evolving textual data streams

Mohamed Ammar; Adel Hidri; Minyar Sassi Hidri

doi:10.1504/IJCAT.2020.107900

Back

Time-sensitive clustering evolving textual data streams

Journal article

Peer reviewed

Time-sensitive clustering evolving textual data streams

Mohamed Ammar, Adel Hidri and Minyar Sassi Hidri

International journal of computer applications in technology, Vol.63(1-2), pp.25-40

01/01/2020

DOI: https://doi.org/10.1504/IJCAT.2020.107900

Abstract

Computer Science

Computer Science, Interdisciplinary Applications

Science & Technology

Technology

Clustering a stream of text documents is an emerging subject of interest since it is widely used in analysing the content in social media and e-journals. The aim is to find a certain structure for unlabelled data based on a similarity criterion. However, few works have focused on this field and fall in this perspective, that's why a new document clustering approach adapted to a stream of text data and test it on news articles data sets is proposed. A distributed representation of words is used, and a bottom-up approach is used to represent documents as vectors on a unit hyper-sphere. The proposed approach gains its roots from the SPherical k-means (SPKM) algorithm and its underlying mixture of von-Mises Fisher (vMF) distributions. The proposed approach yields comparable results to baseline batch algorithm for stable data streams and superior results for rapidly evolving data streams.

Metrics

1 Record Views

Details

Title: Time-sensitive clustering evolving textual data streams
Creators - without role: Mohamed Ammar - Supreme Council Of Health
Adel Hidri - Imam Abdulrahman Bin Faisal University
Minyar Sassi Hidri - Imam Abdulrahman Bin Faisal University
Publication Details: International journal of computer applications in technology, Vol.63(1-2), pp.25-40
Publisher: Inderscience Enterprises Ltd
Number of pages: 16
Identifiers: 9915789208331
Academic Unit: Imam Abdulrahman Bin Faisal University
Language: English
Resource Type: Journal article