吉林大学学报(信息科学版)

• 论文 • 上一篇    下一篇

基于优先级权重的 Hadoop YARN 调度算法

王荣丽, 侯秀萍   

  1. 长春工业大学 计算机科学与工程学院, 长春 130012
  • 收稿日期:2016-07-28 出版日期:2017-09-29 发布日期:2017-10-23
  • 通讯作者: 侯秀萍(1964— ), 女, 长春人, 长春工业大学教授, 硕士生导师, 主要从事软件测试研究, (Tel)86-18643194774(E-mail)houxiuping@ mail. ccut. edu. cn。
  • 作者简介: 王荣丽(1991— ), 女, 吉林省吉林市人, 长春工业大学硕士研究生, 主要从事软件测试和大数据研究, (Tel)86-15844027280(E-mail)wangrongli1217@163. com
  • 基金资助:
     吉林省科技厅科技成果转化基金资助项目(20130303010GX)

Hadoop YARN Scheduling Algorithm Based on Priority Weight

WANG Rongli, HOU Xiuping   

  1. School of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China
  • Received:2016-07-28 Online:2017-09-29 Published:2017-10-23

摘要:  为解决 Hadoop 现有调度器调度任务时不能根据任务的紧迫程度分配资源的问题, 研究 YARN 中的资源
调度机制, 改进原调度器(Capacity Scheduler), 提出一种基于优先级权重的 Hadoop YARN(Yet Another Resource
Negotiator)调度算法(Weight Scheduler Based on Priority)。 为叶子队列设置队列优先级, 结合队列资源利用率和
队列优先级选择队列; 将应用程序的初始权重设置为应用程序优先级的大小, 通过等待时间判断是否更新权
重, 根据权重对队列中的应用程序进行排序, 调度时优先为权重高的应用程序分配资源。 实验结果表明, 与原
有调度算法相比, 改进算法平均任务执行时间约减少 141 s, 平均等待时间减少 34. 5%, 保证了用户执行任务
的相对公平, 提高了用户总体满意度。

关键词: 资源调度器,  Hadoop YARN 框架, 优先级权重

Abstract: In order to solve the problem that the existing scheduling task scheduler Hadoop can not depend on
the degree of urgency task allocation of resources. We mainly research the resource scheduling mechanism in
YARN (Yet Another Resource Negotiator), improved the original capacity scheduler and proposes abased on
priority weighted Hadoop YARN scheduling algorithm-weight scheduler based on priority. Firstly, setting queue
priority for leaf queue, combining with the queue resource utilization and queue priority to select queue;
Secondly, the initial of applications re-set its priorities size, by waitting time to judge whether to update the
weight, according to the weight to sort the applications in the selected queue. The experiment shows that
comparing with the existing scheduling algorithm, this algorithm has the advantage on the execution time by
141 s and waiting time by 34. 5% decreasing, ensure the user to perform a task relatively fair, improve the
overall satisfaction of users.

Key words:  Hadoop yet another resource negotiator(YARN), resource scheduler, priority weight

中图分类号: 

  • TP391