[1] |
Garry Turkington.Hadoop beginner’s guide[M].UK:Packt Publishing,2013.
|
[2] |
Zhou Aiwu,Cheng Bo,Li Sunchang,et al.Session identification methods in web log mining[J].Computer Engineering & Design,2010,31(5):936964.(in Chinese)
|
[3] |
Spiliopoulou M,Mobasher B,Berendt B,et al.A framework for the evaluation of session reconstruction heuristics in web usage analysis [J].Informs Journal of Computing,2003,15 (2):171179.
|
[4] |
Dai Zhili,Wang Xinyu.A session identification method based on dynamic time threshold[J].Computer Applications and Software,2010,27(2):244246.(in Chinese)
|
[5] |
Zhu Peng,Zhao Mingsheng.Session identification algorithm for web log mining[C]∥Proc of 2010 International Conference on Management and Service Science (MASS),2010:14.
|
[6] |
Zheng Lishan,Teng Shaohua.Improved session identification method based on page and time threshold[J].Computer Applications and Software,2012,29(10):197275.(in Chinese)
|
[7] |
Romn PE, Dell R F, Velsquez J D. Advanced techniques in web data preprocessing and cleaning[C]∥Advanced Techniques in Web Intelligence1,2010:1948.
|
[8] |
Zhou Aiwu,Cheng Bo.An improved session identification method in web log mining[J].Microcomputer and Its Application,2010,29(15):7176.(in Chinese)
|
[9] |
Li Yan,Feng Boqin,Lu Xiaofeng.Data preprocessing technology in web log mining[J].Computer Engineering,2009,35(22):4446.(in Chinese)
|
[10] |
Sadagopan N,Li J.Characterizing typical and atypical user sessions in clickstreams[C]∥Porc of the 17th International Conference on World Wide Web,2008:885894.
|
[11] |
Zhang Shuai,Chen Xingshu,Tong Hao,et al.The session identification method based on reference heuristic and url semantics[J].Computer Application Research,2014,31(1):102105.(in Chinese)
|
[12] |
Li Jianjiang,Cui Jian,Wang Dan,et al.The research reviewed of MapReduce parallel programming model[J].Acta Electronica Sinica,2011,39(11):26352642.(in Chinese)
|
[13] |
Web log format[EB/OL].[20100223].http://webdataanalysis.net/referenceandsource/weblogformat/.(in Chinese)
|
[14] |
Fu Y J,Sandhu K,Shih M Y.A generalizationbased approach to clustering of web usage sessions[C]∥Proc of Revised Papers from the International Workshop on Web Usage Analysis and User Profiling,2000:2138.
|
|
附中文参考文献:
|
[2] |
周爱武,程博,李孙长,等.Web日志挖掘中的会话识别方法Table 3comparison chart of experiment results among different session identification methods表3不同会话识别方法实验结果对比会话识别方法会话数正确构建会话数精确度/%查全度/%时间/s基于固定访问间隔阈值方法(R)|R|=1925|R∩R| = 1925100.00100.0016.7基于会话时长时间阈值方法(R1)|R1|=2193|R1∩R|= 85338.9044.3116.4以首页为参引页方法(R2)|R2|=2469|R2∩R|= 101841.2352.8917.5基于语义信息方法(R3)|R3|=2052|R3∩R|= 79138.5441.0925.8基于网络拓扑和动态阈值方法(R4)|R4|=2835|R4∩R|= 170460.1188.5218.1[J].计算机工程与设计,2010,31(5):936964.
|
[4] |
戴智丽,王鑫昱.一种基于动态时间阈值的会话识别方法[J].计算机应用与软件,2010,27(2):244246.
|
[6] |
郑立山,滕少华.改进的页面与时间阈值的会话识别法[J].计算机应用与软件,2012,29(10):197275.
|
[8] |
周爱武,程博.日志挖掘中一种改进的会话识别方法[J].微型机与应用,2010,29(15):7176.
|
[9] |
李燕,冯博琴,鲁晓峰.Web日志挖掘中的数据预处理技术[J].计算机工程,2009,35(22):4446.
|
[11] |
张帅,陈兴蜀,童浩,等.基于引用启发式和URL语义相结合的会话识别方法[J].计算机应用研究,2014,31(1):102105.
|
[12] |
李建江,崔健,王聃,等.MapReduce并行编程模型研究综述[J].电子学报,2011,39(11):26352642.
|
[13] |
网站数据分析.Web日志格式[EB/OL].[20100223].http://webdataanalysis.net/referenceandsource/weblogformat/.
|