J4 ›› 2010, Vol. 28 ›› Issue (03): 309-.

• 论文 • 上一篇    下一篇

KNN算法的数据优化策略

王新颖1a,1b,隽志才1b,2,吴庆妍1c,孙 元1c   

  1. 1.吉林大学 a.计算机科学与技术学院;b. 交通学院;c. 计算机教学与研究中心, 长春 130012;
    2.上海交通大学 安泰经济与管理学院交通研究所, 上海 200030
  • 出版日期:2010-05-30 发布日期:2010-06-12
  • 通讯作者: 吴庆妍(1959— ),女,长春人,吉林大学工程师,主要从事智能控制研究(Tel)86-13604414493 E-mail:wuqy@jlu.edu.cn
  • 作者简介:王新颖(1977— )|女|辽宁丹东人| 吉林大学博士研究生|主要从事智能控制研究|(Tel)86-13504410283(E-mail)xinying@jlu.edu.cn;隽志才(1954— )|男|吉林公主岭人|上海交通大学教授|博士生导师|主要从事交通仿真与智能控制研究(Tel)86-21-52301396(E-mail)juanzhicai@163.com;通讯作者:吴庆妍(1959— )|女|长春人|吉林大学工程师|主要从事智能控制研究(Tel)86-13604414493(E-mail)wuqy@jlu.edu.cn

Data Optimization Strategy of KNN Algorithm

WANG Xin-ying1a,1b,JUAN Zhi-cai1b,2,WU Qing-yan1c,SUN Yuan1c   

  1. 1a. College of Computer Science and Technology|1b. College of Traffic;1c. Center for Computer Fundamental Education, Jilin University,Changchun 130012,China;2. Institute of Transportation Studies, Antai College of Economics &|Management,Shanghai Jiaotong University, Shanghai 200030,China
  • Online:2010-05-30 Published:2010-06-12

摘要:

为了解决基于KNN(K-Nearest Neighbors)算法的非参数回归短时交通状态预测模型执行效率低的问题,提出了KNN算法的数据优化策略。通过对交通状态时空特性的研究,采用层次化对象构造交通状态向量,并根据交通状态的自重复性对历史样本数据库进行数据压缩。实验证明,优化策略提高了KNN算法的执行效率,经过压缩后的数据存取时间比压缩前缩短了8.66%。

关键词: 非参数回归, 短时交通状态预测, KNN算法, 层次化对象, 自重复性

Abstract:

In order to resolve the inefficient of the nonparametric-regressive model for short-term traffic state forecasting based on KNN(K-Nearest Neighbor)algorithm, the paper presents a data optimization strategy of KNN algorithm. Using time and space characteristics of traffic state, the author constructs traffic state vector with hierarchical object structure, and compresses the historical sample database because of the self-repeatability of traffic state. Experiment shows that the optimization strategy of database improves the efficiency of KNN algorithm.After the compressed data access time is shorter 8.66% than the pre-compressed.

Key words: nonparametric-regressive, short-term traffic state forecast, K-nearest neighbors(KNN) algorithm, hierarchical object, self-repeatability

中图分类号: 

  • TP391