基于H.264运动估计的音视频同步编码技术

›› 2012, Vol. 42 ›› Issue (05): 1321-1326.

基于H.264运动估计的音视频同步编码技术

李晓妮¹, 陈贺新¹, 陈绵书¹, 蒙塞夫·嘎博基²

1. 吉林大学通信工程学院,长春 130022;
2. 坦派雷工业大学信号处理系,坦派雷 FI-33101,芬兰

收稿日期:2011-08-22 出版日期:2012-09-01 发布日期:2012-09-01
通讯作者: 陈贺新(1949-),男,教授,博士生导师.研究方向:视频通信与数码处理.E-mail:chx@jlu.edu.cn E-mail:chx@jlu.edu.cn
基金资助:
国家自然科学基金项目(60832002);国家自然科学基金国际合作项目(609111301281);吉林大学研究生创新基金项目(20111061);吉林省自然科学基金项目(20101515).

Audio-video synchronous coding based on motion estimation in H.264

LI Xiao-ni¹, CHEN He-xin¹, CHEN Mian-shu¹, GABBOUJ Moncef²

1. College of Communication Engineering,Jilin University,Changchun 130022,China;
2. Department of Signal Processing, Tampere University of Technology, Tampere FI-33101,Finland

Received:2011-08-22 Online:2012-09-01 Published:2012-09-01

摘要/Abstract

摘要： 提出了一种在H.264运动估计过程中嵌入音频的音视频同步编码方法,利用1/4像素精度的运动搜索,解决了音视频同步编码问题。在发送端,根据1/4像素搜索点和音频的对应关系,在1/4像素运动估计过程中通过调整最优匹配点将音频压缩流嵌入视频中,然后对嵌入音频的视频流进行压缩编码。在解码端,根据嵌入准则,提取音频信息,再对音频和视频信号进行重构和恢复。实验表明,本文方法在不增加音视频压缩数据量、在音视频质量下降较小的情况下,实现了音视频同步压缩编码和传输。

关键词: 信息处理技术, 同步编码, 1/4像素运动估计, 音视频

Abstract: To solve the problem of audio-video synchronous coding effectively, an embedding scheme is proposed. This scheme applies the quarter-pixel motion search during the motion estimation in H.264. At the transmitter terminal, according to the mapping rule between the binary audio bits and the search points, the audio bits are embedded into the video signal by modulating the best search point in the quarter-pixel motion estimation. Then the synchronous coding is applied to the hybrid signals. At the receiver terminal, the audio information bits are extracted from the video stream according to the embedding rule. Then the audio and video signals are reconstructed respectively. Experiment results show that the proposed scheme can achieve audio-video synchronous coding with low embedding cost and good performance in quality of video and audio signals.

Key words: information processing, synchronous coding, quarter-pixel motion estimation, audio and video

中图分类号:

TN919.81

李晓妮, 陈贺新, 陈绵书, 蒙塞夫·嘎博基. 基于H.264运动估计的音视频同步编码技术[J]. , 2012, 42(05): 1321-1326.

LI Xiao-ni, CHEN He-xin, CHEN Mian-shu, GABBOUJ Moncef. Audio-video synchronous coding based on motion estimation in H.264[J]. , 2012, 42(05): 1321-1326.

参考文献

[1] Mohamed El-Helaly, Aishy Amer. Synchronization of processed audio-video signals using time-stamps//IEEE International Conference on Image Processing, San Antonio, 2007.
[2] Zhang Jing-feng, Li Ying, Wei Yan-na. Using timestamp to realize audio-video synchronization in Real-Time streaming media transmission//International Conference on Audio, Language and Image Processing, Shanghai, China, 2008.
[3] Sucharu Aggarwal, Alka Jindal. Comprehensive overview of various lip synchronization techniques//IEEE International Symposium on Biometrics and Security Technologies, Islamabad, 2008.
[4] Oh Kyung-Geune, Jung Chan-Yul, Lee Yong-Gyu, et al. Real-Time lip synchronization between Text-To-Speech (TTS) system and robot mouth//19th IEEE International Symposium on Robot and Human Interactive Communication, Viareggio, Italy, 2010.
[5] Li Xiao-ni, Chen He-xin, Wang Da-zhong, et al. Data hiding in encoded video sequences based on H.264//2010 3rd IEEE International Conference on Computer Science and Information Technology(ICCSIT), Chengdu, China, 2010.
[6] Qi Li-feng, Chen He-xin, Zhao Yan. New synchronization scheme between audio and video//Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, Qingdao, China, 2007.
[7] Li Bing, Shi Mei-Qiang. Audio-Video synchronization coding approach based on H.264/AVC[J]. IEICE Electronics Express, 2009, 6(22): 1556-1561.
[8] Chen Wei-wei, Li Jin, Gabbouj Moncef, et al. Lossless audio hiding method for synchronous Audio-Video coding//36th International Conference on Acoustics, Speech and Signal Processing(ICASSP), Prague, Caech Republic, 2011.
[9] ITU-T Rec. H.264(03/2010). Advanced video coding for generic audiovisual service[S].

相关文章 15

[1]	苏寒松,代志涛,刘高华,张倩芳. 结合吸收Markov链和流行排序的显著性区域检测[J]. 吉林大学学报(工学版), 2018, 48(6): 1887-1894.
[2]	徐岩,孙美双. 基于卷积神经网络的水下图像增强方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1895-1903.
[3]	黄勇,杨德运,乔赛,慕振国. 高分辨合成孔径雷达图像的耦合传统恒虚警目标检测[J]. 吉林大学学报(工学版), 2018, 48(6): 1904-1909.
[4]	李居朋,张祖成,李墨羽,缪德芳. 基于Kalman滤波的电容屏触控轨迹平滑算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1910-1916.
[5]	应欢,刘松华,唐博文,韩丽芳,周亮. 基于自适应释放策略的低开销确定性重放方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1917-1924.
[6]	陆智俊,钟超,吴敬玉. 星载合成孔径雷达图像小特征的准确分割方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1925-1930.
[7]	刘仲民,王阳,李战明,胡文瑾. 基于简单线性迭代聚类和快速最近邻区域合并的图像分割算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1931-1937.
[8]	单泽彪,刘小松,史红伟,王春阳,石要武. 动态压缩感知波达方向跟踪算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1938-1944.
[9]	姚海洋, 王海燕, 张之琛, 申晓红. 双Duffing振子逆向联合信号检测模型[J]. 吉林大学学报(工学版), 2018, 48(4): 1282-1290.
[10]	全薇, 郝晓明, 孙雅东, 柏葆华, 王禹亭. 基于实际眼结构的个性化投影式头盔物镜研制[J]. 吉林大学学报(工学版), 2018, 48(4): 1291-1297.
[11]	陈绵书, 苏越, 桑爱军, 李培鹏. 基于空间矢量模型的图像分类方法[J]. 吉林大学学报(工学版), 2018, 48(3): 943-951.
[12]	陈涛, 崔岳寒, 郭立民. 适用于单快拍的多重信号分类改进算法[J]. 吉林大学学报(工学版), 2018, 48(3): 952-956.
[13]	孟广伟, 李荣佳, 王欣, 周立明, 顾帅. 压电双材料界面裂纹的强度因子分析[J]. 吉林大学学报(工学版), 2018, 48(2): 500-506.
[14]	林金花, 王延杰, 孙宏海. 改进的自适应特征细分方法及其对Catmull-Clark曲面的实时绘制[J]. 吉林大学学报(工学版), 2018, 48(2): 625-632.
[15]	王柯, 刘富, 康冰, 霍彤彤, 周求湛. 基于沙蝎定位猎物的仿生震源定位方法[J]. 吉林大学学报(工学版), 2018, 48(2): 633-639.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed