0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Improvement of CluStream Algorithm Using Sliding Window for the Clustering of Data Streams
Authors :
Sahar Ahsani
1
Morteza Yousef Sanati
2
Muharram Mansoorizadeh
3
1- Buali-Sina University
2- Buali-Sina University
3- Buali-Sina University
Keywords :
data stream, clustering of data stream, window models, sliding window
Abstract :
Today, data are produced in large amounts, mostly in form of data streams. A data stream is an unlimited stream of data that is produced in large amounts and with high speeds. Therefore, it can be defined as a sequence of data objects in specified time intervals. One of the most common processes performed on data streams is clustering which is aimed at dividing the data items into homogeneous groups. A well-known clustering algorithm is Clustream, an implemented version of which has been developed for the distributed environment of Apache Spark. This algorithm makes use of a tilted window. The present paper offers a modified version of the algorithm which utilizes a sliding window for clustering. In the proposed method, only the latest data are used in updating the produced model and the old data are removed, which allows for a higher speed of execution and achieving more desirable results. The proposed algorithm was implemented in Apache Spark. The results of multiple executions of the proposed algorithm on authentic data and comparing them with the Clustream algorithm based on tilted window indicate that our algorithm performs much better in terms of precision.
Papers List
List of archived papers
Security Analysis of MiniApps: Vulnerabilities, Exploits, and a Tailored Mitigation Framework
Keyhan Mohammadi - Arman Moradi - Reza Ebrahimi Atani
GAP: Fault tolerance Improvement of Convolutional Neural Networks through GAN-aided Pruning
Pouya Hosseinzadeh - Yasser Sedaghat - Ahad Harati
Parallel Local Feature Selection For High-dimensional Data
Zhaleh Manbari - Chiman Salavati - Fardin AkhlaghianTab - Barzan Saeedpoor - Himan Delbina - Mahmud Abdulla Mohammad
Using Deep Learning for Classification of Lung Cancer on CT Images in Ardabil Province
Mohammad Ali Javadzadeh Barzaki - Jafar Abdollahi - Mohammad Negaresh - Maryam Salimi - Hadi Zolfeghari - Mohsen Mohammadi - Asma Salmani - Rona Jannati - Firouz Amani
Implementation of a Low-Overhead 2-Bit Parity-Preserving Reversible Vedic Multiplier for Quantum Architectures
Shekoofeh Moghimi - Negin Mashayekhi - Mohammad Reza Reshadinezhad
Uncertainty-Aware Deep Ensembles for Confident Customer Churn Prediction with Rejection Option
Fatemeh Moradi - Mehran Tarif - Mohammadhossein Homaei
Optimizing Text-Based Protocol Clustering in Reverse Engineering with Auto-Encoders and Fine-Tuned Parameters
Shiva Mahmoudzadeh - Mohaddese Nemati - Mehdi Teimouri
Identification of Botnets and Nodes Attacking Smart Cities by Majority Voting Mechanism and Feature Selection
Maliheh Araghchi - Nazbanoo Farzaneh
Classification of benign and malignant tumors in Digital Breast Tomosynthesis images using Radiomic-based methods
Farangis Sajadi moghadam - Saeid Rashidi
A 2D-CNN Architecture for Improving the Classification Accuracy of an Electronic Nose with Different Sensor Positions
Hannaneh Mahdavi - Reza Goldoust - Saeideh Rahbarpour
more
Samin Hamayesh - Version 43.7.0