• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Dynamic mining of sensitive data streams in
heterogeneous complex information networks

XIONG Ju-xia1,2,3,WU Jin-zhao1,2,3
  

  1. (1.Chengdu  Institute of Computer Application,Chinese Academy of Sciences,Chengdu  610041;
    2.University of Chinese Academy of Sciences,Beijing 100049;
    3.Guangxi Key Laboratory of Hybrid Computational and IC Design Analysis,
    Guangxi University for Nationalities,Nanning 530006,China)
  • Received:2019-06-13 Revised:2019-10-09 Online:2020-04-25 Published:2020-04-25

Abstract:

For the sensitive data streams with high-dimensional redundancy in heterogeneous complex information networks, the probability of data feature formation is low, which leads to multiple mining, high memory usage, low mining accuracy and long running time. Aiming at the above problems, a dynamic network sensitive data stream mining method based on the maximum inter-class divergence is proposed. The maximum difference interval between sensitive data is used as the basis for classification to obtain the maximum inter-class divergence of the network sensitive data. The optimal divergence iterative function is determined in the genetic iterative state. The mining characteristics of the iterative function are preferably selected to obtain the dynamic mining characteristics. Clustering analysis is performed on the mining characteristics to obtain data hiding information modes. These modes are evaluated, and knowledge representation is carried out on the reasonable information modes, so as to realize the dynamic mining of the sensitive data streams in the heterogeneous complex information networks. The experimental results show that the mineable feature formation probability of the method can be up to 98%, and the labels are close to the actual values. The method has the advantages of high mining accuracy, short running time and low memory usage.
 

Key words: heterogeneous complex information network, sensitive data stream, dynamic mining, divergence iterative function, clustering analysis