0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Improvement of CluStream Algorithm Using Sliding Window for the Clustering of Data Streams
Authors :
Sahar Ahsani
1
Morteza Yousef Sanati
2
Muharram Mansoorizadeh
3
1- Buali-Sina University
2- Buali-Sina University
3- Buali-Sina University
Keywords :
data stream, clustering of data stream, window models, sliding window
Abstract :
Today, data are produced in large amounts, mostly in form of data streams. A data stream is an unlimited stream of data that is produced in large amounts and with high speeds. Therefore, it can be defined as a sequence of data objects in specified time intervals. One of the most common processes performed on data streams is clustering which is aimed at dividing the data items into homogeneous groups. A well-known clustering algorithm is Clustream, an implemented version of which has been developed for the distributed environment of Apache Spark. This algorithm makes use of a tilted window. The present paper offers a modified version of the algorithm which utilizes a sliding window for clustering. In the proposed method, only the latest data are used in updating the produced model and the old data are removed, which allows for a higher speed of execution and achieving more desirable results. The proposed algorithm was implemented in Apache Spark. The results of multiple executions of the proposed algorithm on authentic data and comparing them with the Clustream algorithm based on tilted window indicate that our algorithm performs much better in terms of precision.
Papers List
List of archived papers
Joint ADC-less Analog Demodulator and Decoder for Extended Binary (8, 4, 4) Hamming Channel Code
Mir Mahdi Safari - Jafar Pourrostam - Behzad Mozaffari Tazehkand
Cluster Sampling: A Cluster-Driven Sampling Strategy for Deep Metric Learning
Hamideh Rafiee - Ahmad Ali Abin - Seyed Soroush Majd
A Comprehensive Approach to SMS Spam Filtering Integrating Embedded and Statistical Features
Shaghayegh Hosseinpour - Mohammad Reza Keyvanpour
Enhancing Persian Word Sense Disambiguation with Large Language Models: Techniques and Applications
Fatemeh Zahra Arshia - Saeedeh Sadat Sadidpour
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
Hybrid navigation based on GPS data and SIFT-based place recognition using Biologically-inspired SLAM
Sahar Salimpour Kasebi - Hadi Seyedarabi - Javad Musevi Niya
Histopathology Image-Based Cancer Classification Utilizing Transfer Learning Approach
Amir Meydani - Alireza Meidani - Ali Ramezani - Maryam Shabani - Mohammad Mehdi Kazeminasab - Shahriar Shahablavasani
Degarbayan-SC: A Colloquial Paraphrase Farsi Subtitles Dataset
Mohammad Javad Aghajani - Mohammad Ali Keyvanrad
Efficient Vision Transformer for Accurate Traffic Sign Detection
Javad Mirzapour Kaleybar - Hooman Khaloo - Avaz Naghipour
CSI-Based Human Activity Recognition using Convolutional Neural Networks
Parisa Fard Moshiri - Mohammad Nabati - Reza Shahbazian - Seyed Ali Ghorashi
more
Samin Hamayesh - Version 41.5.3