吉林大学学报(信息科学版) ›› 2019, Vol. 37 ›› Issue (4): 457-462.

• • 上一篇    

基于场景理论的STAC 课程数据库自动检索系统

李曙军1a,张宏杰2,王海棠1b,王秋爽3   

  1. 1. 国网河北省电力有限公司a. 培训中心党校工作部; b. 培训中心,石家庄050023;2. 北京敏行创业国际管理咨询有限公司,北京101100; 3. 吉林大学计算机科学与技术学院,长春130012
  • 出版日期:2019-07-24 发布日期:2019-12-16
  • 作者简介:李曙军( 1969— ) ,男,河北高碑店人,国网河北省电力有限公司高级讲师,主要从事国企党校教育研究,( Tel) 86- 18031130612( E-mail) lishujun20156@163. com。
  • 基金资助:
    吉林大学本科教学改革研究基金资助项目( 2017XYB070)

Design of Stac Course Database Automatic Retrieval System Based on Scene Theory

LI Shujun1a,ZHANG Hongjie2,WANG Haitang1b,WANG Qiushuang3   

  1. 1a. Department of Training Center Party School WorkState Grid Hebei Electric Power Company Limited;1b. Training Center,State Grid Hebei Electric Power Company Limited,Shijiazhuang 050023,China;2. Beijing Minxing Pioneering International Management Consulting CoLtd,Beijing 101100,China;3. College of Computer Science and Technology,Jilin University,Changchun 130012,China
  • Online:2019-07-24 Published:2019-12-16

摘要: 由于传统课程数据库检索系统查全效果较差,同时受到噪声影响,导致检索精准度较低,不能满足用户对Stac( Statistical Analysis) 课程数据库检索的需求。为此,提出基于场景理论的Stac 课程数据库自动检索系统设计。在场景理论下,对数据库自动检索系统进行总体设计,添加分词模块,采用组合型歧义统计方式,区分Stac 课程数据库中同义或多义词; 使用网络蜘蛛寻找网页链接地址,读取内容,进行全部目标地址检索; 当采集量达到一定规模时,调用数个独立的搜索引擎,相互合作,以此建立索引库,根据Stac 课程资源数据规范标准进行数据采集,利用索引引擎,将采集结果全部输入到系统中。通过辨认情景特点,建立光盘数据库,设计检索流程,严密监视各个机器行为,避免噪声干扰,经过中心DB Server( Data Base Senver) 处理,将地址列表合并,形成新资源列表,供用户检索。由实验结果可知,该系统检索精准度最高可达到98%,为多图像检索提供系统支持。

关键词: 场景理论, Stac 课程, 数据库, 自动检索, 索引, 引擎

Abstract: Due to the poor performance of the traditional course database retrieval system and the influence of noise,the retrieval accuracy is low,which can not meet the user's demand for Stac ( Statistical Analysis) course database retrieval. To this end,the design of the automatic retrieval system of Stac course database based on scene theory is proposed. Under the scene theory,the design of the database automatic retrieval system adding the word segmentation module,uses the combined ambiguity statistical method to distinguish synonymous or polysemous words in the Stac course database; and uses the web spider to find the web link address,reading the content,and performing all the goals address retrieval. When the collection volume reaches a certain scale,several independent search engines are called and cooperated with each other to establish an index library,collect data according to the Stac curriculum resource data specification standard,and use the index engine to input all the collection results into the system. By identifying the characteristics of the scene,the CD-ROM database is created,the retrieval process is designed,the behavior of each machine is closely monitored,and noise interference is avoided. After processing by the central DB ( Data Base) Server,the address lists are merged to form a new resource list for the user to retrieve. Experimental results show that the retrieval accuracy of the system can reach up to 98%,providing systematic support for multi-image retrieval.

Key words: scene theory, statistical analysis ( Stac) course, database, automatic retrieval, index, engine

中图分类号: 

  • TP391. 1