J4 ›› 2010, Vol. 40 ›› Issue (5): 1211-1216.

• 地球探测与信息技术 • 上一篇    下一篇

基于核函数理论的系统聚类分析

陈永良|李学斌   

  1. 吉林大学 |综合信息矿产预测研究所|长春 130026
  • 收稿日期:2010-01-08 出版日期:2010-09-26 发布日期:2010-09-26
  • 作者简介:陈永良(1965-)|男|辽宁朝阳人|教授|主要从事矿产资源评价、数学地质方法、遥感图像处理和GIS应用等方面的研究|E-mail:chenyongliang2009@hotmail.com
  • 基金资助:

    国家自然科学基金项目(40872193)

Kernel-Based Hierarchical Cluster Analysis

 CHEN Yong-liang, LI Xue-bin   

  1. Mineral Resources Prediction Institute of Comprehensive Information, Jilin University, Changchun 130026,China
  • Received:2010-01-08 Online:2010-09-26 Published:2010-09-26

摘要:

为了完善系统聚类分析算法理论,使之具有区分数据集非线性集群特征的能力,将核函数理论和系统聚类分析算法有机结合,推导出基于核函数理论的系统聚类分析方法。其基本思路是:把样本从低维观测空间非线性变换至高维像空间,使样本变得线性可分;然后,应用核函数理论“隐式”地实现高维像空间的系统聚类分析。用Pb、Bi、Mo质量浓度作为化探异常的分类依据,对8处化探异常进行分类实验研究,在Pb、Bi、Mo两两组合的二维平面图中,8处化探异常明显地分为(1, 3, 8),(2, 4)和(5, 6, 7)3个点群,用核系统聚类方法能够很好地区分出这3个点群;而传统系统聚类方法却把8处化探异常错分成(1, 3, 8, 6)和(2, 4, 5, 7)两个类。由此可见,核系统聚类方法的类群区分能力高于传统系统聚类方法。

关键词: 核函数, 聚类分析, 观测空间, 像空间

Abstract:

In order to improve the algorithmic theory and make the hierarchical cluster analysis be able to find nonlinear clusters in a data set, the authors develop a kernel-based hierarchical cluster analysis method by integrating kernel functions with the hierarchical cluster analysis algorithm. The procedure of the new cluster algorithm can be described as follows. The input samples in the low-dimensional input space are nonlinearly mapped to a high-dimensional image space where the samples are linearly separable, and then a kernel function is applied to implicitly execute the hierarchical cluster analysis in the image space. The authors conduct an experiment on the unsupervised classification of eight geochemical anomalies according to the contents of Pb, Bi, and Mo. The eight geochemical anomalies are obviously divided into the three clusters, (1, 3, 8), (2, 4), and (5, 6, 7), according to the three pair-wised scatter plots derived from the contents of Pb, Bi, and Mo. Kernel hierarchical cluster analysis is able to properly differentiate these three clusters while the conventional hierarchical cluster analysis improperly classifies the eight geochemical anomalies into the two classes, (1, 3, 8, 6) and (2, 4, 5,7). Therefore, the clustering ability of the new method exceeds that of the conventional hierarchical cluster algorithm.

Key words: kernel functions, cluster analysis, observing space, image space

中图分类号: 

  • P628.1
[1] 陈欢庆, 穆剑东, 王珏, 邓西里. 扇三角洲沉积储层特征与定量评价——以辽河西部凹陷某试验区于楼油层为例[J]. 吉林大学学报(地球科学版), 2017, 47(1): 14-24.
[2] 姜传金,陈树民,刘财,鹿琪,冯智慧. 基于Wigner双谱对角切片的谱分解技术在油气检测中的应用[J]. 吉林大学学报(地球科学版), 2013, 43(3): 1013-1021.
[3] 陈永良,李学斌. 核概率距离聚类方法及应用[J]. 吉林大学学报(地球科学版), 2013, 43(1): 312-318.
[4] 杨勇强, 邱隆伟, 孙宝强, 付育璞. 微观组分聚类分析在物源体系恢复中的应用——以民丰洼陷沙三段中亚段为例[J]. J4, 2012, 42(1): 30-38.
[5] 陈永良, 李学斌. 基于核最小二乘模型的矿产靶区预测[J]. J4, 2011, 41(3): 937-944.
[6] 苗长盛, 董清水, 张旗, 崔大勇, 许圣传, 王敏雪. 储层流动单元研究在油田老区挖潜中的应用--以吉林油田乾146区块开发为例[J]. J4, 2011, 41(1): 39-45.
[7] 王伟, 张永波, 叶浩, 郭娇, 王乾, 张礼中, 蔡子昭, 梁国玲, 张春英. 内蒙古砒砂岩的模糊聚类分析[J]. J4, 2009, 39(6): 1168-1172.
[8] 李军辉,卢双舫,柳成志,李笑梅,苏鹤成,杨雨. 锦45块储层流动单元[J]. J4, 2009, 39(2): 190-0195.
[9] 邓小英,李月. 基于Ricker子波核的支持向量回归方法及其在地震勘探记录去噪处理中的应用[J]. J4, 2007, 37(4): 821-0827.
[10] 曾道明,纪宏金,陈 满,胡大千,朱永正. 胶东山城金矿地质与地球化学变量的关系[J]. J4, 2006, 36(04): 511-515.
[11] 纪宏金,孙丰月,陈满,胡大千,时艳香,潘向清. 胶东地区裸露含金构造的地球化学评价[J]. J4, 2005, 35(03): 308-0312.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!