基于多相似度模糊 C 均值聚类的不均衡流数据检索方法

Journal of Jilin University (Information Science Edition) ›› 2024, Vol. 42 ›› Issue (4): 726-732.

Previous Articles Next Articles

Data Retrieval Method of Unbalanced Streaming Based on Multi-Similarity Fuzzy C-Means Clustering

HAN Yunna

Basic Department, Modern College of Northwest University, Xi蒺an 710130, China

Received:2023-05-26 Online:2024-07-22 Published:2024-07-22

Abstract

Abstract: During the retrieval process of imbalanced stream data, the performance of data retrieval decreases due to the presence of imbalance in the data stream and the susceptibility to differential and edge data. In order to reduce the impact of the above factors, an imbalanced stream data retrieval method based on multi similarity fuzzy C-means clustering is proposed. This method calculates the multiple similarities between imbalanced flow data, and uses fuzzy C-means algorithm to cluster data with different similarities. By constructing a octree retrieval model, the data after clustering is stored, encoded and judged to complete the retrieval of unbalanced stream data. The experimental results show that the retrieval time of the proposed method is less than 20 seconds, and the recall and precision rates remain above 80% , with high NDCG( Normalized Discounted Cumulative Gain) values.

Key words: standard feature matrix, cross cluster, data encoding filter, unbalanced measure; 3D coordinates, judgment code

CLC Number:

TP393. 08

HAN Yunna. Data Retrieval Method of Unbalanced Streaming Based on Multi-Similarity Fuzzy C-Means Clustering[J].Journal of Jilin University (Information Science Edition), 2024, 42(4): 726-732.

Data Retrieval Method of Unbalanced Streaming Based on Multi-Similarity Fuzzy C-Means Clustering

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 0

Metrics

Comments

Recommended 10