Data clustering is a prevalent challenge in big data processing, and parallelizing clustering operations significantly enhances efficiency in applications involving frequent searches. Various clustering techniques are available for data grouping, with CBAR being widely used across different applications. Parallelizing CBAR is essential for big data, and the Hadoop MapReduce platform offers a suitable framework to improve efficiency by leveraging effective segmentation techniques. This book involves designing and implementing algorithms for CBAR using the MapReduce approach, with testing conducted on clusters of up to 4 nodes. The results demonstrate substantial performance gains, which are analyzed and discussed with illustrative examples.

Детали книги:

ISBN-13:

978-3-659-91275-7

ISBN-10:

3659912751

EAN:

9783659912757

Язык книги:

English

By (author) :

Sayantan Singha Roy

Количество страниц:

76

Опубликовано:

14.11.2024

Категория:

Технология