0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Improvement of CluStream Algorithm Using Sliding Window for the Clustering of Data Streams
Authors :
Sahar Ahsani
1
Morteza Yousef Sanati
2
Muharram Mansoorizadeh
3
1- Buali-Sina University
2- Buali-Sina University
3- Buali-Sina University
Keywords :
data stream, clustering of data stream, window models, sliding window
Abstract :
Today, data are produced in large amounts, mostly in form of data streams. A data stream is an unlimited stream of data that is produced in large amounts and with high speeds. Therefore, it can be defined as a sequence of data objects in specified time intervals. One of the most common processes performed on data streams is clustering which is aimed at dividing the data items into homogeneous groups. A well-known clustering algorithm is Clustream, an implemented version of which has been developed for the distributed environment of Apache Spark. This algorithm makes use of a tilted window. The present paper offers a modified version of the algorithm which utilizes a sliding window for clustering. In the proposed method, only the latest data are used in updating the produced model and the old data are removed, which allows for a higher speed of execution and achieving more desirable results. The proposed algorithm was implemented in Apache Spark. The results of multiple executions of the proposed algorithm on authentic data and comparing them with the Clustream algorithm based on tilted window indicate that our algorithm performs much better in terms of precision.
Papers List
List of archived papers
Trust Management Enhancement for the Internet of Things: a Smart Contract Approach
Amin Rouzbahani - Fattaneh Taghiyareh
Deep Learning-Based Malaysian Sign Language (MSL) Recognition: Exploring the Impact of Color Spaces
Ervin Gubin Moung - Precilla Fiona Suwek - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Wei Leong Khong
Lightweight Local Transformer for COVID-19 Detection Using Chest CT Scans
Hojat Asgarian Dehkordi - Hossein Kashiani - Amir Abbas Hamidi Imani - Shahriar Baradaran Shokouhi
Multi-Task Transformer for Stock Market Trend Prediction
Seyed Morteza Mirjebreili - Ata Solouki - Hamidreza Soltanalizadeh - Mohammad Sabokrou
BioBERT-based SNP-traits Associations Extraction from Biomedical Literature
Mohammad Dehghani - Behrouz Bokharaeian - Zahra Yazdanparast
Weakly Supervised Learning in a Group of Learners with Communication
Ali Ganjbakhsh - Ahad Harati
Predicting cascading failure with machine learning methods in the interdependent networks
Mohamad Hossein Maghsoodi - Mohamad Khansari
A Comprehensive Dataset of Real-scene Images for Text Detection and Recognition in Persian
Iman Souzanchi - Ramin Rahimi - Mohammad Ali Majidi Anvari - Atefeh Baniasadi - Ashkan Sadeghi - Mohammad Reza Mohammadi
Reversible Data Insertion in Encryption Domain Based on Reduced Quad Difference Expansion
Alireza Ghaemi - Mohammad Zare Ehteshami - Amirhossein Ghaemi
Pyramid Transformer for Traffic Sign Detection
Omid Nejati manzari - Amin Boudesh - Shahriar B. Shokouhi
more
Samin Hamayesh - Version 42.4.1