Zhang Zhen, Wang Bin-Qiang, Chen Hong-Chang, Ma Hai-Long. Internet Traffic Classification Based on Host Connection Graph[J]. Journal of Electronics & Information Technology, 2013, 35(4): 958-964. doi: 10.3724/SP.J.1146.2012.01040
Citation:
Zhang Zhen, Wang Bin-Qiang, Chen Hong-Chang, Ma Hai-Long. Internet Traffic Classification Based on Host Connection Graph[J]. Journal of Electronics & Information Technology, 2013, 35(4): 958-964. doi: 10.3724/SP.J.1146.2012.01040
Zhang Zhen, Wang Bin-Qiang, Chen Hong-Chang, Ma Hai-Long. Internet Traffic Classification Based on Host Connection Graph[J]. Journal of Electronics & Information Technology, 2013, 35(4): 958-964. doi: 10.3724/SP.J.1146.2012.01040
Citation:
Zhang Zhen, Wang Bin-Qiang, Chen Hong-Chang, Ma Hai-Long. Internet Traffic Classification Based on Host Connection Graph[J]. Journal of Electronics & Information Technology, 2013, 35(4): 958-964. doi: 10.3724/SP.J.1146.2012.01040
Considering at the concept drift issue of machine learning identification, a novel algorithm called traffic classification based on Host Connection Graph (HCG) is proposed. Considering {IP Address, Port} as the unique user identifier, HCG constructs a host connection graph and innovates the concept of user similarity. Based on the theory of graph mining, social community is abstracted from communications among hosts by partitioning the graph into mutually intersectant behavior clusters. In order to reach traffic classification, HCG not only conceives a definition called User Behavior Mode (UBM) to analyse the implicit traffic characteristics, but also maps application labels to every host behavior by employing UBM and Port. Finally, simulations are conducted based on the real network trace. Results demonstrate that HCG can circumvent the concept shift problem and ameliorate gracefully computational complication without sacrificing accuracy.