一种分层组合的半监督近邻传播聚类算法
doi: 10.3724/SP.J.1146.2012.00673
Semi-supervised Affinity Propagation Clustering Algorithm Based on Stratified Combination
-
摘要: 针对近邻传播(AP)聚类算法的计算复杂度和准确性,该文提出一种分层组合的半监督近邻传播聚类算法(SAP-SC)。算法引入分层聚类的思想,将一次AP聚类过程等分成若干层聚类,使得处理过程简单、易于实现;每层只关注聚类困难的数据点,并通过构造成对点约束和使用子簇标签映射进行半监督学习;基于组合提升的方法将各层聚类结果加权叠加,从而提升了算法的准确性能。理论分析和实验结果表明:算法在聚类准确性和计算复杂度方面有了较大改进。Abstract: Considering the complexity and the accuracy, an improved affinity propagation clustering algorithm Semi-supervised Affinity Propagation clustering algorithm based on Stratified Combination (SAP-SC) is proposed. In order to make the operation simplified and easily-implemented, the proposed algorithm introduces a stratified clustering method which equally partitions the integrative clustering process into several smaller blocks. Focusing on the hard clustering data, every layer employs semi-supervised learning to conceive pair-wise constraints and maps each sub-cluster with the corresponding label. Also, assembled boosting method is utilized to weight together all layered results to improve the clustering performance. Finally, theoretical analysis and experimental results show that the algorithm can achieve both higher accuracy and better computational performance.
计量
- 文章访问数: 2400
- HTML全文浏览量: 111
- PDF下载量: 1094
- 被引次数: 0