基于超图正则化受限的概念分解算法

李雪; 赵春霞; 舒振球; 郭剑辉

doi:10.11999/JEIT140799

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名

邮箱

手机号码

标题

留言内容

验证码

基于超图正则化受限的概念分解算法

doi: 10.11999/JEIT140799 cstr: 32379.14.JEIT140799

李雪^{* 赵春霞舒振球郭剑辉},
赵春霞,
舒振球,
郭剑辉

基金项目:

国家自然科学基金(61272220, 61101197, 90820306)，中国博士后科学基金(2014M551599)，江苏省社会安全图像与视频理解重点实验室基金(30920130122006)和江苏省普通高校研究生科研创新计划项目(KYLX_0383)资助课题

计量
- 文章访问数: 2412
- HTML全文浏览量: 227
- PDF下载量: 1106
- 被引次数: 0
出版历程
- 收稿日期: 2014-06-17
- 修回日期: 2014-10-15
- 刊出日期: 2015-03-19

Hyper-graph Regularized Constrained Concept Factorization Algorithm

Li Xue^{* 赵春霞舒振球郭剑辉},
Zhao Chun-Xia,
Shu Zhen-Qiu,
Guo Jian-Hui

摘要

摘要: 针对概念分解(Concept Factorization, CF)算法没有同时考虑样本中存在的类别信息及数据间多元几何结构信息的问题，该文提出一种基于超图正则化受限的概念分解(Hyper-graph regularized Constrained Concept Factorization, HCCF)算法。HCCF算法通过构建一个无向加权的拉普拉斯超图正则项，提取数据间的多元几何结构信息，克服了传统图模型只能表达数据间成对关系的缺陷；同时采用硬约束的方式使样本的类别信息在低维空间中保持一致，充分利用了标记样本的类别信息。该文采用乘性迭代的方法求解HCCF算法的目标函数并证明了其收敛性。在TDT2库、Reuters库和PIE库上的实验结果表明，HCCF算法提高了聚类的准确率和归一化互信息，验证了算法的有效性。
- 信息处理 /
- 概念分解 /
- 聚类 /
- 硬约束 /
- 超图 /
- 流形学习
Abstract: The Concept Factorization (CF) algorithm can not take into account the label information and the multi-relationship of samples simultaneously. In this paper, a novel algorithm called Hyper-graph regularized Constrained Concept Factorization (HCCF) is proposed, which extracts the multi-geometry information of samples by constructing an undirected weighted hyper-graph Laplacian regularize term, hence overcomes the deficiency that traditional graph model expresses pair-wise relationship only. Meanwhile, HCCF takes full advantage of the label information of labeled samples as hard constraints, and it preserves label consistent in low-dimensional space. The objective function of HCCF is solved by the iterative multiplicative updating algorithm and its convergence is also proved. The experimental results on TDT2, Reuters, and PIE data sets show that the proposed approach achieves better clustering performance in terms of accuracy and normalized mutual information, and the effectiveness of the proposed approach is verified.
- Information processing /
- Concept Factorization(CF) /
- Cluster /
- Hard constraints /
- Hyper-graph /
- Manifold learning