吉林大学学报(信息科学版)

• 论文 • 上一篇    下一篇

基于时频域特征的场景音频研究

张 勇1a,1b ,张 溯1a,王旭东2 ,路 阳3,1b ,王 臣1a   

  1.  1. 东北石油大学 a. 电子科学学院; b. 黑龙江省网络化与智能控制重点实验室,黑龙江 大庆 163318;2. 大庆油田有限责任公司第一采油厂 仪表安装维修大队,黑龙江 大庆 163453;3. 黑龙江八一农垦大学 电气与信息学院,黑龙江 大庆 163319
  • 收稿日期:2018-01-17 出版日期:2018-05-24 发布日期:2018-07-25
  • 作者简介:张勇( 1974— ) ,男,吉林农安人,东北石油大学副教授,硕士生导师,主要从事信号与信息处理方向研究,( Tel) 86-13604594911( E-mail) dqpizy@ 163. com。
  • 基金资助:
    国家自然科学基金资助项目( 61374127; 61422301) ; 黑龙江省杰出青年科学基金资助项目( JC2015016) ; 中国博士后科学基金资助项目( 2016M591560) ; 黑龙江省政府博士后资助经费资助项目( LBH-Z15185) ; 黑龙江省博士后科研启动金资助项目资助项目( 103679)

Scene Audio Research Based on Time-Frequency Domain Characteristics

ZHANG Yong1a,1b ,ZHANG Su1a ,WANG Xudong2 ,LU Yang3,1b ,WANG Chen1a   

  1. 1a. College of Electronic Sciences; 1b. Heilongjiang Province Network and Intelligent Control Key Laboratory,Northeast Petroleum University,Daqing 163318,China; 2. Instrument Maintenance Battalion,No. 1 Oil Provide Factory of Daqing Oilfield Limited Company,Daqing 163453,China; 3. College of Electrical and Information, Heilongjiang Bayi Agricultural University,Daqing 163319,China
  • Received:2018-01-17 Online:2018-05-24 Published:2018-07-25

摘要: 随着人们对于场景音频研究的逐渐深入,现有的分析方式由于存在不能完整反映音频的声学特性等弊端,已经无法满足人们的需求。基于时频域特征的分析方式可以很好地解决这一问题,即通过提取场景音频的语谱图,使待分析信号中包含的声学事件得到完整保留,使其表现得更加直观。语谱图中包含着丰富的纹理信息,选取不同窗长,可分别得到场景音频的宽带语谱图和窄带语谱图。对比实验表明,窄带语谱图可以更好的反映出待分析信号中所包含声学事件的趋势、连续性及分布特征。因此对场景音频进行时频域特征分析更适合使用窄带语谱图。

关键词: 语谱图, 场景音频, 窗函数, 窄带语谱图

Abstract: As people get deeper and deeper into the scene audio research,the existing analytical methods can not satisfy the needs of people. The analysis method based on time-frequency domain characteristics can solve this problem well,by extracting the spectrogram of scene audio,the acoustic events included in the analysis signal can be fully retained,and it make the acoustic characteristics more intuitive. The spectrogram of scene audio contains rich texture information,by setting different length of the window,we can get the wide-band spectrogram and narrow-band spectrogram of the scene audio separately. The comparison experiment shows that the narrow-band spectrogram can better reflect the trend,continuity and distribution characteristics of the acoustic events in the analysis signal. It has better characterization than the wide-band spectrogram. Therefore,if the time-frequency domain characteristics analysis method is used for scene audio,it is best to use narrow-band spectrogram.

Key words: window function, narrow-band spectrogram, spectrogram, scene audio

中图分类号: 

  • TP391. 42