基于密度峰值的不确定性数据聚类算法

Journal of Jilin University (Information Science Edition) ›› 2026, Vol. 44 ›› Issue (2): 392-398.

Previous Articles Next Articles

Clustering Algorithm for Uncertain Data Based on Peak Density

LANG Jiayun, DING Xiaomei

School of Computer Engineering, Anhui Wenda Information Engineering College, Hefei 231201, China

Received:2024-05-26 Online:2026-04-14 Published:2026-04-14

Abstract

Abstract:

Due to the large scale of uncertain data and limited accuracy in clustering, the efficiency of data clustering is low. Therefore, a density peak based uncertain data clustering algorithm is proposed. Using the Mahalanobis distance method, interfering sample data is eliminated with low correlation, the missing values of uncertain data samples is calculated through entropy, and gradually the reverse restoration is performed. Using the density peak calculation method the distribution of cluster centers is determined. A decision graph is introduced to partition data clusters, the K-nearest neighbor idea is used to calculate the trust values of non cluster center data samples, secondary identification and partitioning of data points and noise with large trust value differences within clusters, optimizing the density peak clustering method. The experimental results show that when facing large-scale data, accurate clustering can still be achieved with less clustering time. The proposed method has high computational efficiency and has great significance for uncertain data mining and analysis.

Key words:

Markov distance method, Entropy value, decision diagram, peak density, K-nearest neighbor thought

CLC Number:

TP301. 6

LANG Jiayun, DING Xiaomei.

Clustering Algorithm for Uncertain Data Based on Peak Density

[J].Journal of Jilin University (Information Science Edition), 2026, 44(2): 392-398.

Clustering Algorithm for Uncertain Data Based on Peak Density

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 0

Metrics

Comments

Recommended 10