Discovering Clusters of Arbitrary Shapes and Densities in Data Streams

Discovering Clusters of Arbitrary Shapes and Densities in Data Streams

A density-based and grid-based approach to discover clusters in data streams

LAP Lambert Academic Publishing ( 2011-11-03 )

€ 49,00

Buy at the MoreBooks! Shop

The huge size of a continuously flowing data has put forward a number of challenges in data stream analysis. Exploration of the structure of streamed data represented a major challenge that resulted in introducing various clustering algorithms. However, current clustering algorithms still lack the ability to efficiently discover clusters of arbitrary densities in data streams. In this thesis, a new grid-based and density-based algorithm is proposed for clustering data streams. It addresses drawbacks of recent algorithms in discovering clusters of arbitrary densities. The algorithm uses an online component to map the input data to grid cells. An offline component is then used to cluster the grid cells based on density information. Relative density relatedness measures and a dynamic range neighborhood are proposed to differentiate clusters of arbitrary densities. The experimental evaluation shows considerable improvements upon the state-of-the-art algorithms in both clustering quality and scalability with different stream sizes and with higher dimensions. In addition, the output quality of the proposed algorithm is less sensitive to parameter selection errors.

Book Details:

ISBN-13:

978-3-8465-2434-3

ISBN-10:

3846524344

EAN:

9783846524343

Book language:

English

By (author) :

Amr Magdy
Nagwa M. El-Makky
Noha A. Yousri

Number of pages:

116

Published on:

2011-11-03

Category:

Informatics