0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Improvement of CluStream Algorithm Using Sliding Window for the Clustering of Data Streams
Authors :
Sahar Ahsani
1
Morteza Yousef Sanati
2
Muharram Mansoorizadeh
3
1- Buali-Sina University
2- Buali-Sina University
3- Buali-Sina University
Keywords :
data stream, clustering of data stream, window models, sliding window
Abstract :
Today, data are produced in large amounts, mostly in form of data streams. A data stream is an unlimited stream of data that is produced in large amounts and with high speeds. Therefore, it can be defined as a sequence of data objects in specified time intervals. One of the most common processes performed on data streams is clustering which is aimed at dividing the data items into homogeneous groups. A well-known clustering algorithm is Clustream, an implemented version of which has been developed for the distributed environment of Apache Spark. This algorithm makes use of a tilted window. The present paper offers a modified version of the algorithm which utilizes a sliding window for clustering. In the proposed method, only the latest data are used in updating the produced model and the old data are removed, which allows for a higher speed of execution and achieving more desirable results. The proposed algorithm was implemented in Apache Spark. The results of multiple executions of the proposed algorithm on authentic data and comparing them with the Clustream algorithm based on tilted window indicate that our algorithm performs much better in terms of precision.
Papers List
List of archived papers
Multi Model CNN Based Gas Meter Characters Recognition
Sanaz Tarhib - Jafar Tanha - Soodabeh Imanzadeh - Sahar Hassanzadeh Mostafaei
Word-level Persian Lipreading Dataset
Javad Peymanfard - Ali Lashini - Samin Heydarian - Hossein Zeinali - Nasser Mozayani
Multi-Layer Collaborative Graph with BPR Similarity Embedding for Recommender System
Mostafa Ghorbani - Azadeh Mansouri
Optimizing Question-Answering Framework Through Integration of Text Summarization Model and Third-Generation Generative Pre-Trained Transformer
Ervin Gubin Moung - Toh Sin Tong - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Farashazillah Yahya
Bridging the Synthetic-to-Real Gap (BSRG): Creating Simulated Datasets for Domain Adaptation to Enhance Vehicle Detection
Behnaz Sadeghigol - Mohammad Ali Keyvanrad
Assessing Users' Influence on Respondents in Conversation Quality: A Quantitative Study on Reddit Based on the Cooperative Principle
Afsaneh Habibi - Fattaneh Taghiyareh
Compressing Deep Neural Networks Using Explainable AI
Kimia Soroush - Mohsen Raji - Behnam Ghavami
Virus-Antiviral Prediction Using Machine and Deep Learning Methods
Shayan Majidifar - Fatemeh Nasiri - Mohsen Hooshmand
An Exploratory Study of the Relationship between SATD and Other Software Development Activities
Shima Esfandiari - Ashkan Sami
Camouflage Object Segmentation with Attention-Guided Pix2Pix and Boundary Awareness
Erfan Akbarnezhad Sany - Fatemeh Naserizadeh - Parsa Sinichi - Seyyed Abed Hosseini
more
Samin Hamayesh - Version 41.3.1