吉林大学学报(工学版) ›› 2025, Vol. 55 ›› Issue (3): 993-1000.doi: 10.13229/j.cnki.jdxbgxb.20240173

• 计算机科学与技术 • 上一篇    下一篇

基于块方向预测的AV1快速帧内编码算法

严利民(),金炜烨   

  1. 上海大学 微电子研究与开发中心,上海 200444
  • 收稿日期:2024-02-21 出版日期:2025-03-01 发布日期:2025-05-20
  • 作者简介:严利民(1971-),男,副教授,博士.研究方向:图像处理,计算机视觉,信息安全,集成电路设计与应用.E-mail:yanlm@shu.edu.cn
  • 基金资助:
    国家自然科学基金项目(61774101)

Fast algorithm based on block direction prediction for AV1 intra encoding

Li-min YAN(),Wei-ye JIN   

  1. Microelectronics Research and Development Center,Shanghai University,Shanghai 200444,China
  • Received:2024-02-21 Online:2025-03-01 Published:2025-05-20

摘要:

AV1压缩技术增加了编码复杂性,为了满足实时通信(RTC)应用要求,在低端平台上,编码器要以低延迟、低比特率、低编码时间运行,这使得在RTC任务中,使用AV1编码器具有挑战性。对此,本文提出了一种AV1快速帧内编码算法。通过预先计算块的方向特性,在帧内编码方向预测遍历期间,跳过与预先计算的方向显著不同的方向,从而提高libaom-AV1编码器的编码速度,节省帧内编码时间,并加速AV1在低延迟视频编码应用中的进展。实验结果表明:该算法可节省25.89%的运行时间,且编码时间节省后,编码性能损失较小。

关键词: AV1, libaom, 实时编码, 帧内编码速度优化

Abstract:

AV1 compression technology increases encoding complexity. In order to meet the requirements of real-time communication (RTC) applications, encoders need to run at low latency, low bit rate, and low encoding time on low-end platforms, which makes using AV1 encoders challenging in RTC tasks. In this regard, this article proposes an AV1 fast intra frame encoding algorithm. By pre calculating the directional characteristics of blocks, during the intra frame encoding direction prediction traversal, significantly different directions from the pre calculated direction are skipped, thereby improving the encoding speed of the libaom-AV1 encoder, saving intra frame encoding time, and accelerating the progress of AV1 in low latency video encoding applications. The experimental results show that the algorithm can save 25.89% of running time, and after saving encoding time, the loss of encoding performance is relatively small.

Key words: AV1, libaom, real-time encoding, intra encoding speed optimization

中图分类号: 

  • TP312

图1

AV1视频编码框架"

图2

AV1中方向性帧内预测模式示意图"

图3

BasketballDrill预测模式分布图"

图4

BasketballDrill左下角放大后像素图"

图5

16×16的块放大至最大示意图"

图6

16×16的块像素值图"

图7

8×8块中沿方向d=0∶7的像素的行号k"

图8

算法流程图"

表1

快速帧内预测算法优化情况"

分辨率序列名称原编码时间/s优化后编码耗时/sTS/%BD-rate损失/%
416×240BasketballPass22.4915.1532.640.47
BlowingBubbles29.7420.3431.610.33
BQSquare23.2417.2825.650.37
RaceHorses123.4816.6828.960.40
832×480BasketballDrillText87.7367.2823.310.18
BasketballDrill79.8961.3323.230.61
BQMall90.6163.1530.310.40
PartyScene96.3377.2419.820.36
RaceHorses283.4062.6924.830.34
1 024×768ChinaSpeed86.9670.1719.310.17
1 280×720SlideEditing105.8787.8417.030.06
SlideShow50.8940.3820.650.32
1 920×1 080BasketballDrive350.95268.0223.630.24
BQTerrace368.42303.5617.600.31
Cactus524.32370.9229.260.25
Kimono1383.74257.1033.000.11
ParkScene508.43354.9030.200.14
vidyo4120.1887.7027.030.38
2 560×1 600PeopleOnStreet723.64509.9029.540.30
Traffic805.04560.6230.360.25

表2

不同方法的比较结果"

算法TS/%BD-rate损失/%
文献[415.860.44
文献[58.670.04
本文25.890.29
1 Akyazi P, Ebrahimi T. Comparison of compression efficiency between HEVC/H.265,VP9 and AV1 based on subjective quality assessments[J/OL]. [2024-01-28].
2 Wiegand T, Sullivan G J, Bjontegaard G, et al. Overview of the H.264/AVC video coding standard[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2003, 13(7): 560-576.
3 Han J, Li B, Mukherjee D, et al. A technical overview of AV1[J]. Proceedings of the IEEE, 2021, 109(9): 1435-1462.
4 Jeong J, Gankhuyag G, Kim Y H. Fast chroma prediction mode decision based on luma prediction mode for AV1 intra coding[C]∥International Conference on Information and Communication Technology Convergence (ICTC),Jeju, Korea (South),2019: 1050-1052.
5 Corrêa M, Zatt B, Palomino D, et al. A fast local mode decision for the HEVC intra prediction based on direction detection[J/OL]. [2024-01-28].
6 Xu M, Jeon B. Selection of intra prediction tools for fast AV1 encoding[C]∥IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB),Paris, France, 2020: No.9379847.
7 Zhao X, Zhao L, Krishnan M, et al. Study on coding tools beyond AV1[C]∥IEEE International Conference on Multimedia and Expo (ICME),Shenzhen, China, 2021: No.9428244.
8 Bross B, Wang Y K, Ye Y, et al. Overview of the versatile video coding (VVC) standard and its applications[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2021, 31(10): 3736-3764.
9 Chen Y, Murherjee D, Han J, et al. An overview of core coding tools in the AV1 video codec[C]∥2018 Picture Coding Symposium (PCS),San Francisco, USA, 2018: 41-45.
10 Zhao L, Zhao X, Liu S. Improved intra coding beyond AV1 using adaptive prediction angles and reference lines[C]∥IEEE International Conference on Image Processing (ICIP),Abu Dhabi, United Arab Emirates,2020: 3408-3412.
11 Li J, Li B, Xu J, et al. Efficient multiple-line-based intra prediction for HEVC[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2016, 28(4): 947-957.
12 Bjontegaard G. Calculation of average PSNR differences between RD-curves[R].Geneva: ITU SG16 Doc. VCEG-M33, 2001.
[1] 刘义艳,刘兴,刘方方,代杰. 基于改进ELM-Markov Model的建筑结构稳定性监测算法[J]. 吉林大学学报(工学版), 2024, 54(11): 3351-3357.
[2] 刘洲洲,孙传新,王晓柱,张杨梅. 采用VGG19和低通滤波的红外与微光图像融合方法[J]. 吉林大学学报(工学版), 2023, 53(1): 255-262.
[3] 王晓宇, 欧阳丹彤, 赵剑, 耿雪娜. 基于冲突的离散事件系统诊断方法[J]. 吉林大学学报(工学版), 2013, 43(02): 380-385.
[4] 郭振华, 吴艳霞, 张国印, 杨杰, 顾国昌. 面向C2VHDL编译器的基本块级指针分析算法[J]. 吉林大学学报(工学版), 2013, 43(02): 417-423.
[5] 方美玉, 郑小林, 陈德人, 华艺, 施艳. 商品评论聚焦爬虫算法设计与实现[J]. 吉林大学学报(工学版), 2012, 42(增刊1): 377-381.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!