• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (07): 1209-1215.

• Computer Network and Znformation Security • Previous Articles     Next Articles

A low-frequency log noise filtering method in business process based on string matching algorithm

HE Zi-xian,FANG Xian-wen   

  1. (School of Mathematics and Big Data,Anhui University of Science & Technology,Huainan 232001,China)
  • Received:2022-01-21 Revised:2022-06-13 Accepted:2023-07-25 Online:2023-07-25 Published:2023-07-11

Abstract: The process mining field focuses on the analysis of data generated by business process execution, aiming to extract operational process knowledge from the data. However, there may be some noise in the low-frequency logs of the model, which may negatively affect the analysis. Therefore, a method based on frequency change rules and string matching is proposed to identify and filter noise from low-frequency event logs. Firstly, based on the directly-follows graph and the eventually-follows graph, invalid direct activity pairs are identified from the event log sequence set according to frequency change rules. Then, combined with an improved string matching algorithm (KMP), the invalid activity sequences are matched with the low-frequency log traces based on the correspondence between the direct relationship of the direct-follow graph and the sequence fragment of the event log, thus filtering the noise in the log and optimizing the mining model. Finally, the effectiveness of the method is verified through specific case analysis and simulation experiments.

Key words: noise, directly-follows graph, eventually-follows graph, KMP, filtering, optimization ,