• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (6): 24-29.

• 论文 • 上一篇    下一篇

分布式软件系统运行时监测框架研究

刘东红,邹鹏   

  1. (国防科学技术大学计算机学院,湖南 长沙 410073)
  • 出版日期:2013-06-25 发布日期:2013-07-03
  • 基金资助:

    国家自然科学基金资助项目(90818028,61202117)

Research on framework for
distributed software runtime monitoring          

LIU Donghong,ZOU Peng   

  1. (School of Computer Science,National University of Defense Technology,Changsha 410073,China)
  • Online:2013-06-25 Published:2013-07-03

摘要:

软件监测是评价大规模分布式系统行为是否符合预期的重要手段,也是进一步实施在线调整和演化活动的前提和基础。给出了一个名为Mega的分布式软件系统运行时监测框架,可以支持具备监测能力的分布式软件系统的开发和运行。一方面,该框架通过引入一种专门的监测需求描述语言,在分布式系统中实现了监测逻辑和功能逻辑的分离,使得监测逻辑可以高效开发和独立维护。另一方面,该框架通过引入为监测数据处理而设计的事件模型,使得大规模系统的监测具备可伸缩和可扩展性。在阐述Mega框架特点的基础上,进一步给出了基于这一框架的若干监测设计模式,可以有效支持具备监测能力的大规模系统开发。

关键词: 运行时监测, 框架, 分布式软件

Abstract:

Software monitoring is an important means to check whether a largescale distributed system complies with its intended behavior, which is the foundation of further online adjustment and evolution activities. This paper proposes a framework named Mega that supports the development and running of monitoringenabled distributed software systems. On the one hand, a dedicated language is introduced into the framework to describe where and what should be monitored at runtime, which separates the monitoring logic and the functional logic and enables the former to be developed efficiently and maintained independently. On the other hand, an eventbased model for monitoring data processing is introduced, which is scalable and extensible in largescale distributed systems. Based on the description of the feature of the Mega framework, this paper presents a set of monitoring design patterns as well, which can support the development of largescale distributed systems with monitoring capability.

Key words: runtime monitoring;software framework;distributed software