吉林大学学报(理学版)

• 计算机科学 • 上一篇    下一篇

分布式流处理系统中基于位置感知的操作共享优化算法

魏晓辉, 刘晓斐, 李洪亮, 李兰馨, 李丽娜, 崔浩龙, 李想   

  1. 吉林大学 计算机科学与技术学院, 长春 130012
  • 收稿日期:2016-03-11 出版日期:2016-09-26 发布日期:2016-09-19
  • 通讯作者: 李洪亮 E-mail:lihongliang@jlu.edu.cn

Operation Sharing Optimization AlgorithmBased on Location Aware in DSPS

WEI Xiaohui, LIU Xiaofei, LI Hongliang, LI Lanxin, LI Lina, CUI Haolong, LI Xiang   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2016-03-11 Online:2016-09-26 Published:2016-09-19
  • Contact: LI Hongliang E-mail:lihongliang@jlu.edu.cn

摘要:

针对流数据规模大、 基本数据处理操作有重叠等特点, 提出一种基于位置感知的操作共享优化算法. 假设操作预先分配了计算节点资源, 在保证流处理实时性和动态性的基础上, 利用多个流数据作业间拓扑结构和操作功能的相似性, 基于DAG图匹配方法建立操作共享备选集; 以输出带宽为约束条件, 建立一个操作共享收益优化
问题模型, 解决了流处理过程中传统操作共享方法对集群的输出带宽影响考虑不充分的问题. 仿真实验结果表明, 该方法较原有算法更充分地考虑了计算节点制约因素, 并提高了共享收益, 从而可有效地节省系统计算资源.

关键词: 流数据, 分布式流处理, 操作共享, 共享优化

Abstract:

We proposed an operation sharing optimization algorithm based on the location aware according to the data flow in large scale, and the basic data processing operations had overlapping features. We assumed that the operations had assigned the resources of compute nodes in advance, in guaranteed the stream processing realtime and dynamic state, to take advantage of the similarity between topologies and operational functions of multiple stream data operations. We established an operation sharing revenue optimized model based on the DAG matching method on the constraint of output bandwidth. This method tackled the insufficiency consideration of traditional operation for operation sharing method to the cluster of output bandwidth. Simulation experiment results show that this method has fuller consideration in the calculation of the node constraints and improves the benefits of sharing, thereby saving the computing resources of the system effectively.

Key words: stream data, distributed stream processing, operation sharing, sharing optimization

中图分类号: 

  • TP391