Efficient Identification of Common Subsequences from Big Data Streams Using Sliding Window Technique

Adi Alhudhaif

doi:10.14569/IJACSA.2014.051106

Back

Efficient Identification of Common Subsequences from Big Data Streams Using Sliding Window Technique

Journal article

Open access

Efficient Identification of Common Subsequences from Big Data Streams Using Sliding Window Technique

Adi Alhudhaif

International journal of advanced computer science & applications, Vol.5(11), pp.29-32

01/01/2014

DOI: https://doi.org/10.14569/IJACSA.2014.051106

Abstract

Computer Science

Computer Science, Theory & Methods

Science & Technology

Technology

We propose an efficient Frequent Sequence Stream algorithm for identifying the top k most frequent subsequences over big data streams. Our Sequence Stream algorithm gains its efficiency by its time complexity of linear time and very limited space complexity. With a pre-specified subsequence window size S and the k value, in very high probabilities, the Sequence Stream algorithm retrieve the top k most frequent subsequences of size S. The Stream Sequence algorithm also provides a high accuracy of the estimation of the number of occurrences of each promoted subsequence. Our experiments indicate several factors that influence the result accuracy of the Sequence Stream algorithm: stream size, subsequence size S and frequency of the subsequence.

Files and links (1)

url

https://doi.org/10.14569/IJACSA.2014.051106View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Efficient Identification of Common Subsequences from Big Data Streams Using Sliding Window Technique
Creators - without role: Adi Alhudhaif - George Washington University
Publication Details: International journal of advanced computer science & applications, Vol.5(11), pp.29-32
Publisher: Science & Information Sai Organization Ltd
Number of pages: 4
Identifiers: 9925834908331
Academic Unit: Prince Sattam Bin Abdulaziz University
Language: English
Resource Type: Journal article