INFORMATION SCIENCES, cilt.575, ss.454-467, 2021 (SCI-Expanded)
Clustering is a commonly used tool for data management and analysis. One of the prominent group of clustering methods consists of the density-based clustering algo-rithms. The use of fuzzy neighborhood functions for density-based clustering algorithms are known to significantly improve the robustness, such that choosing neighborhood parameters is rather easy for the user. On the other hand, because of the overhead of the fuzzy calculations, they demand higher computing resources. This study discusses how FN-DNSCAN-a fuzzy density-based clustering algorithm-can be implemented efficiently. A rather specific FN-DBSCAN algorithm that adopts techniques used to improve classical density-based clustering algorithms is introduced. Also, a parallel version of the algorithm is proposed and their implementation details are discussed. The proposed algorithms are tested in a set of comparative experiments, along with a straightforward FN-DBSCAN implementation and a curious but unsafe modification of the parallel algorithm. The results of the experiments that are conducted in a modest parallel computing environment of 32 processing units, show a wide variety of differences in relative speed-ups ranging from 2 to 850 times. (c) 2021 Elsevier Inc. All rights reserved.