0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Improvement of CluStream Algorithm Using Sliding Window for the Clustering of Data Streams
Authors :
Sahar Ahsani
1
Morteza Yousef Sanati
2
Muharram Mansoorizadeh
3
1- Buali-Sina University
2- Buali-Sina University
3- Buali-Sina University
Keywords :
data stream, clustering of data stream, window models, sliding window
Abstract :
Today, data are produced in large amounts, mostly in form of data streams. A data stream is an unlimited stream of data that is produced in large amounts and with high speeds. Therefore, it can be defined as a sequence of data objects in specified time intervals. One of the most common processes performed on data streams is clustering which is aimed at dividing the data items into homogeneous groups. A well-known clustering algorithm is Clustream, an implemented version of which has been developed for the distributed environment of Apache Spark. This algorithm makes use of a tilted window. The present paper offers a modified version of the algorithm which utilizes a sliding window for clustering. In the proposed method, only the latest data are used in updating the produced model and the old data are removed, which allows for a higher speed of execution and achieving more desirable results. The proposed algorithm was implemented in Apache Spark. The results of multiple executions of the proposed algorithm on authentic data and comparing them with the Clustream algorithm based on tilted window indicate that our algorithm performs much better in terms of precision.
Papers List
List of archived papers
An Improved and Accurate Measure for Mining Correlated High-utility Itemsets
Amir Masoud Heidari Orojloo - Morteza Keshtkaran
Virus-Antiviral Prediction Using Machine and Deep Learning Methods
Shayan Majidifar - Fatemeh Nasiri - Mohsen Hooshmand
Instance Selection from Skewed Class Distributions by Using the multi-objective optimizer
Mona Moradi - Javad Hamidzadeh
Efficient Sub-Carrier Relationship Extraction for Human Activity Recognition via EEGNet in Wireless Sensing
Siavash Zaravashan - Sadegh ArefiZadeh - Sajjad Torabi
A Novel Deformable Registration Method for Cerebral Magnetic Resonance Images
Bahareh Asadpour Dasht Bayaz - Mahdi Saadatmand - Fabrice Wallois
Experimental evaluation and comparison of anti-pattern detection tools by the gold standard
Somayeh Kalhor - Mohammad reza Keyvanpour - Afshin Salajegheh
DFIG-WECS Renewable Integration to the Grid and Stability Improvement through Optimal Damping Controller Design
Theophilus Ebuka Odoh - Aliyu Sabo - Hossien Shahinzadeh - Noor Izzri Abdul Wahab - Farshad Ebrahimi
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
Forecasting El Niño Six Months in Advance Utilizing Augmented Convolutional Neural Network
Mohammad Naisipour - Iraj Saeedpanah - Arash Adib - Mohammad Hossein Neisi Pour
Blind Load-Balancing Algorithm using Double-Q-learning in the Fog Environment
Niloofar Tahmasebi pouya - Mehdi Agha Sarram
more
Samin Hamayesh - Version 42.4.1