An adaptive clustering algorithm by neighbourhood search for large-scale data


Sevinc B., Gürler S.

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, cilt.93, sa.1, ss.175-187, 2023 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 93 Sayı: 1
  • Basım Tarihi: 2023
  • Doi Numarası: 10.1080/00949655.2022.2098298
  • Dergi Adı: JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Business Source Elite, Business Source Premier, CAB Abstracts, Communication Abstracts, Metadex, Veterinary Science Database, zbMATH, Civil Engineering Abstracts
  • Sayfa Sayıları: ss.175-187
  • Anahtar Kelimeler: Clustering algorithm, neighbourhood search, adaptive cluster sampling, DBSCAN
  • Dokuz Eylül Üniversitesi Adresli: Evet

Özet

Adaptive cluster sampling (ACS) is a sampling method relies on the neighbourhood search on a grid structure. It has an adaptive selection process of units and recursively added units reveal the batched individuals easily and quickly. In this paper, we propose a new clustering method called spatial adaptive clustering (SAC) based on the idea of ACS design. The SAC algorithm forms clusters based on neighbourhood search using grid structures and is able to detect noise points. The performance of the proposed algorithm is evaluated through comparison with the results from well-known density-based clustering approaches in the literature using real and artificial data sets. Computational results indicate that the proposed algorithm is effective in terms of external validation measures for clustering of arbitrary shaped data with noise. Additionally, the SAC algorithm is tested on artificial data sets of varying sizes for the runtime criterion. The results reveal that it also performs superbly for the objective of reducing the runtime.