[1]何诗佳,刘晓强,李柏岩,等.网站变更监测预警系统的设计与实现[J].南京师范大学学报(工程技术版),2021,21(01):030-35.[doi:10.3969/j.issn.1672-1292.2021.01.005]
 He Shijia,Liu Xiaoqiang,Li Baiyan,et al.Design and Implementation of Website Change Monitoringand Early Warning System[J].Journal of Nanjing Normal University(Engineering and Technology),2021,21(01):030-35.[doi:10.3969/j.issn.1672-1292.2021.01.005]
点击复制

网站变更监测预警系统的设计与实现
分享到:

南京师范大学学报(工程技术版)[ISSN:1006-6977/CN:61-1281/TN]

卷:
21卷
期数:
2021年01期
页码:
030-35
栏目:
计算机科学与技术
出版日期:
2021-03-15

文章信息/Info

Title:
Design and Implementation of Website Change Monitoringand Early Warning System
文章编号:
1672-1292(2021)01-0030-06
作者:
何诗佳12刘晓强1李柏岩1蔡立志2胡 芸2
(1.东华大学计算机科学与技术学院,上海 201620)(2.上海市计算机软件评测重点实验室,上海 201112)
Author(s):
He Shijia12Liu Xiaoqiang1Li Baiyan1Cai Lizhi2Hu Yun2
(1.College of Computer Science and Technology,Donghua University,Shanghai 201620,China)(2.Shanghai Key Laboratory of Computer Software Testing and Evaluating,Shanghai 201112,China)
关键词:
网站内容篡改网站变更监测MD5文本对比算法分布式存储消息机制
Keywords:
website content tamperingwebsite change monitoringMD5text comparison
分类号:
TP391.1
DOI:
10.3969/j.issn.1672-1292.2021.01.005
文献标志码:
A
摘要:
网站易成为黑客入侵篡改的对象,网站的实时变更监测对于网站安全尤为重要. 针对目前大规模进行网站实时变更监测的难点,设计并实现了一种基于非关系型数据库和消息机制的网站变更监测方案. 系统采用爬虫技术进行网站页面实时爬取,通过分布式数据存储和消息机制实现对多网站的实时分析,采用了MD5值与文本对比相结合的算法进行网站内容变更监测,并对监测结果进行可视化. 此外,当网站出现异常变更时,支持实时处理告警及紧急切断服务,减少由于网站内容被篡改所带来的不良影响.
Abstract:
Websites are easy to become the target of hacking and tampering. The real-time monitoring of website changes is particularly important for the safety of websites. Regarding the difficulties of large-scale real-time website change monitoring,we design and implement a website change monitoring system based on non-relational database and message mechanism. It uses crawler technology to crawl web pages in real time,and realizes real-time analysis of multiple websites through distributed data storage and message mechanism. An algorithm combining MD5 value and text comparison is designed to monitor website content changes and the results are visualized on the monitoring browser. When abnormal changes occur,it supports real-time alarming and emergency cut-off services in order to reduce the adverse effects caused by website content tampering.

参考文献/References:

[1] FETTERLY D,MANASSE M,NAJORK M,et al. A large-scale study of the evolution of web pages[J]. Software Practice and Experience,2004,34(2):213-237.
[2]魏文晗,邓一贵. 基于局部变化性的网页篡改识别模型及方法[J]. 计算机应用,2013,33(2):430-433.
[3]盛博文. WEB网站内容更新检测关键技术研究[D]. 哈尔滨:哈尔滨工程大学,2017.
[4]刘江. 网页篡改监控系统的设计与实现[D]. 北京:北京邮电大学,2018.
[5]王伟,魏乐,刘文清,等. 基于ElasticSearch的分布式全文搜索系统[J]. 电子科技,2018,31(8):56-59,65.
[6]陈付梅,韩德志,毕坤,等. 大数据环境下的分布式数据流处理关键技术探析[J]. 计算机应用,2017,37(3):620-627.
[7]THOMAS H C,CHARLES E L,RONALD L R,et al. 算法导论[M]. 3rd ed. 殷建平,徐云,王刚,等译. 北京:机械工业出版社,2013.

备注/Memo

备注/Memo:
收稿日期:2020-08-08.
通讯作者:刘晓强,教授,研究方向:智能信息系统、语义Web、软件工程. E-mail:liuxq@dhu.edu.cn
更新日期/Last Update: 2021-03-15