高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于AVS2的色度扩展视频编码的设计和实现

王淑慧

王淑慧. 基于AVS2的色度扩展视频编码的设计和实现[J]. 电子与信息学报, 2018, 40(12): 2936-2944. doi: 10.11999/JEIT180154
引用本文: 王淑慧. 基于AVS2的色度扩展视频编码的设计和实现[J]. 电子与信息学报, 2018, 40(12): 2936-2944. doi: 10.11999/JEIT180154
Shuhui WANG. Design and Implementation for Chroma Extensions Video Coding Based on AVS2 Platform[J]. Journal of Electronics & Information Technology, 2018, 40(12): 2936-2944. doi: 10.11999/JEIT180154
Citation: Shuhui WANG. Design and Implementation for Chroma Extensions Video Coding Based on AVS2 Platform[J]. Journal of Electronics & Information Technology, 2018, 40(12): 2936-2944. doi: 10.11999/JEIT180154

基于AVS2的色度扩展视频编码的设计和实现

doi: 10.11999/JEIT180154
基金项目: 国家自然科学基金(61601200, 61871289),上海市自然科学基金(18ZR1440600)
详细信息
    作者简介:

    王淑慧:女,1973年生,副研究员,研究方向为视频编码算法及SOC实现

    通讯作者:

    王淑慧  shw@tongji.edu.cn

  • 中图分类号: TN919.8

Design and Implementation for Chroma Extensions Video Coding Based on AVS2 Platform

Funds: The National Natural Science Foundation of China (61601200, 61871289), Shanghai Natural Science Foundation (18ZR1440600)
  • 摘要: 色度扩展视频编码是当前视频编码领域的一个热点研究课题。该文提出基于AVS2平台的色度扩展视频帧内编码的实现方案。仿444/422编码方案通过将输入图像中的色度分量下采样后,使用原有的420方式进行编码,以实现444/422编码。进一步,该文将帧内预测及环路滤波等编码模块无缝扩展到相应的444/422格式,实现444/422帧内预测编码。实验结果表明,对444格式和422格式序列,在高码率的情况下444/422帧内预测编码与仿444/422编码相比,U/V平均BD-rate的减少分别为31.44%/31.72%和18.85%/19.3%,而Y分量平均BD-rate的增加仅为0.5%。其中422色度帧内预测过程的算法优化减少Y/U/V BD-rate最高可达5.66%。与HEVC RExt编码相比,在低码率时,444/422帧内预测编码取得了更好或相近的编码性能。
  • 图  1  HEVC RExt中的全444/422编码架构

    图  2  P444/422编码架构

    图  3  IP444/422编码架构

    图  4  不同色度格式对应的帧内预测块划分方式

    图  5  Bilinear模式预测

    图  6  角度模式示意图

    表  1  8个测试序列

    序号 序列名 短名 分辨率 比特数/样点 帧率
    1 iceAerial_3840×2160_444(422)_10 bit ARL 3840×2160 10 30
    2 iceRiver_3840×2160_444(422)_10 bit RVR 3840×2160 10 30
    3 iceRock_3840×2160_444(422)_10 bit RCK 3840×2160 10 30
    4 iceRock2_3840×2160_444(422)_10 bit RK2 3840×2160 10 30
    5 Bubbles_4096×2160_24_10bit_444(422) BBE10b 4096×2160 10 24
    6 Traffic_2560×1600_30_10 bit_444(422)_crop TFC10b 2560×1600 10 30
    7 Bubbles_4096×2160_24_12 bit_444(422) BBE12b 4096×2160 12 24
    8 Traffic_2560×1600_30_12 bit_444(422)_crop TFC12b 2560×1600 12 30
    下载: 导出CSV

    表  2  两组测试QP

    传统QP组 超高质量QP组
    P444/422及IP444/422编码器,
    10-bit序列
    41, 48, 54, 61 14, 21, 28, 34
    P444/422及IP444/422编码器,
    12-bit序列
    57, 64, 70, 77 30, 37, 44, 50
    HM16.6编码器 22, 27, 32, 37 2, 7, 12, 17
    下载: 导出CSV

    表  3  IP444/422与P444/422实验结果比较(%)

    序列名 QP=14, 21, 28, 34 or 30, 37, 44, 50 QP=41, 48, 54, 61 or 57, 64, 70, 77
    Y U V Y U V
    YUV444 ARL –0.59 –71.07 –47.01 0.08 –4.42 –2.31
    RVR –0.21 –25.13 –39.49 0.43 4.49 1.36
    RCK 0.18 –16.67 –14.77 0.43 7.47 7.88
    RK2 0.21 –27.59 –18.88 0.43 2.29 0.51
    BBE10b 0.17 –48.05 –38.37 0.13 –22.79 –15.33
    TFC10b 1.85 –27.33 –55.52 0.53 –6.05 –9.89
    BBE12b 0.24 –28.43 –18.69 –0.14 –15.12 –8.67
    TFC12b 2.12 –7.27 –21.05 0.12 –3.76 –5.52
    444平均 0.50 –31.44 –31.72 0.25 –4.74 –4.00
    YUV422 ARL 0.08 –32.45 –21.12 5.45 –3.48 –1.82
    RVR 0.58 –18.22 –29.51 3.58 4.79 1.99
    RCK 0.65 –11.52 –9.44 3.94 8.96 8.74
    RK2 0.71 –21.95 –13.92 3.89 1.92 1.75
    BBE10b 0.06 –19.36 –18.34 1.27 –2.63 –0.68
    TFC10b 0.81 –21.35 –38.49 3.46 –2.43 –5.66
    BBE12b 0.43 –8.77 –6.16 0.18 –2.98 –1.37
    TFC12b 0.90 –17.14 –17.42 0.30 –1.69 –2.81
    422平均 平均 0.53 –18.85 –19.30 2.76 0.31 0.02
    下载: 导出CSV

    表  4  IP444/422编码与HM16.6实验结果比较(%)

    序列名 QP=14, 21, 28, 34 or 30, 37, 44, 50 QP=41, 48, 54, 61 or 57, 64, 70, 77
    Y U V Cpsnr Y U V Cpsnr
    YUV444 ARL –8.91 71.73 65.46 –2.32 –13.07 64.85 58.25 –5.98
    RVR –8.66 87.33 117.26 –1.61 –6.25 100.29 97.44 –0.39
    RCK –6.27 65.96 68.14 –3.26 –9.45 41.87 40.87 –6.92
    RK2 –6.60 78.85 82.33 –3.04 –8.97 41.30 36.70 –6.84
    BBE10b –21.38 144.38 100.00 1.08 –34.36 99.20 101.21 –14.43
    TFC10b –33.59 100.00 100.00 71.28 –20.12 83.45 68.89 –8.13
    BBE12b –20.58 100.00 100.00 12.25 –25.99 138.27 125.42 –2.67
    TFC12b –32.97 100.00 100.00 66.38 –12.85 84.05 81.61 –1.69
    444平均 –17.37 93.53 91.65 17.60 –16.38 81.66 76.30 –5.88
    YUV422 ARL –5.32 11.11 12.37 –3.01 –6.08 7.18 5.96 –4.22
    RVR –5.67 26.32 37.23 –1.82 –2.27 13.82 17.62 –0.31
    RCK –3.18 25.05 24.41 –1.45 –4.34 3.84 2.76 –3.69
    RK2 –3.10 29.96 29.46 –1.01 –3.66 1.06 –4.21 –3.51
    BBE10b –11.98 23.52 23.79 –3.75 –16.58 18.07 25.56 –9.42
    TFC10b –16.17 100.00 77.21 23.00 –6.84 18.05 14.23 –2.53
    BBE12b –10.98 49.59 54.40 2.84 –7.30 31.64 36.23 0.66
    TFC12b –15.42 100.00 126.25 22.01 –2.16 20.82 21.55 1.66
    422平均 平均 –8.98 45.69 48.14 4.60 –6.15 14.31 14.96 –2.67
    下载: 导出CSV

    表  5  422格式色度帧内预测修改的实验结果(%)

    序列名 QP=14, 21, 28, 34 or 30, 37, 44, 50 QP=41, 48, 54, 61 or 57, 64, 70, 77
    Y U V Y U V
    ARL –1.68 –1.61 –1.73% –3.24 –3.02 –2.73
    RVR –1.96 –1.01 –1.91 –2.91 –2.41 –2.48
    RCK –1.57 –0.17 0.08 –2.82 –5.30 –5.66
    RK2 –1.43 –1.22 0.09 –2.40 –2.50 –2.74
    BBE10b –0.86 –0.82 –0.70 –2.48 –3.53 –3.37
    TFC10b –1.05 –2.60 –1.25 –1.71 –2.60 –1.22
    BBE12b –0.80 –2.54 –2.19 –5.44 –2.13 –0.77
    TFC12b –1.14 –3.70 –4.47 –2.90 –2.20 –1.58
    平均 –1.31 –1.71 –1.51 –2.99 –2.96 –2.57
    下载: 导出CSV
  • FRANÇOIS E, RUSANOVSKYY D, YIN P, et al. Suggested new draft text of signalling, backward compatibility and display adaptation for HDR/WCG video coding[C]. JCTVC-Z0023, Geneva, Switzerland, 2017: 1–31.
    GOMMELET D, ROUMY A, GUILLEMOT C, et al. Gradient-based tone mapping for rate-distortion optimized backward-compatible high dynamic range compression[J]. IEEE Transactions on Image Processing, 2017, 26(12): 5936–5949 doi: 10.1109/TIP.2017.2740159
    HENRY F, JUNG J, OUACH A, et al. Stereoscopic 360 video compression with the next generation video codec[C]. JVET-G0064, Torino, Italy, 2017: 1–5.
    FLYNN D, MARPE D, NACCARI M, et al. Overview of the range extensions for the HEVC standard: Tools, profiles, and performance[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2016, 26(1): 4–19 doi: 10.1109/TCSVT.2015.2478707
    PENG W H, WALLS F, COHEN R A, et al. Overview of screen content video coding: Technologies, standards, and beyond[J]. IEEE Journal on Emerging Selected Topics in Circuits and Systems, 2016, 6(4): 393–408 doi: 10.1109/JETCAS.2016.2608971
    林涛, 蔡文婷, 陈先义, 等. 一种高性能低复杂度的基于串匹配的屏幕图像无损压缩算法[J]. 电子与信息学报, 2017, 39(2): 351–359 doi: 10.11999/JEIT160560

    LIN Tao, CAI Wenting, CHEN Xianyi, et al. Lossless compression algorithm based on string matching with high performance and low complexity for screen content coding[J]. Journal of Electronics&Information Technology, 2017, 39(2): 351–359 doi: 10.11999/JEIT160560
    ZHAO Liping, ZHOU Kailun, GUO Jing, et al. A universal string matching approach to screen content coding[J]. IEEE Transactions on Multimedia, 2018, 20(4): 796–809 doi: 10.1109/TMM.2017.2758519
    ZHU Weijia, ZHANG Kai, AN Jicheng, et al. Inter-palette coding in screen content coding[J]. IEEE Transactions on Broadcasting, 2017, 63(4): 673–679 doi: 10.1109/TBC.2017.2711144
    FLYNN D, NACCARI M, ROSEWARNE C, et al. High Efficiency Video Coding (HEVC) range extensions text specification: Draft 7[C], JCTVC-Q1005, Valencia, Spain, 2014.
    WIEN M, BARONCINI V, BOYCE J, et al. Preliminary joint call for evidence on video compression with capability beyond HEVC[C], JVET-E1002, Geneva, Switzerland, 2017: 1–9.
    AVS N2378. AVS第59次会议视频组会议纪要[C]. 海口, 中国, 2016.

    AVS N2378. Meeting summary of video coding subgroup of the 59th AVS meeting[C]. Haikou, China, 2016.
    王淑慧, 林涛, 郭靖, 等. AVS2-P2框架下对YUV444图像的编码[C]. AVS M4082, 海口, 中国, 2016: 1–7.

    WANG Shuhui, LIN Tao, Guo Jing, et al. YUV444 image coding in AVS2-P2[C]. AVS M4082, Haikou, China, 2016: 1–7.
    王淑慧, 林涛, 郭靖, 等. AVS2-P2框架下对YUV444和YUV422(简称CExt)图像的编码[C]. AVS M4154, 北京, 中国, 2017: 1–10.

    WANG Shuhui, LIN Tao, Guo Jing, et al. YUV444 and YUV422 image coding in AVS2-P2[C]. AVS M4154, Beijing, China, 2017: 1–10.
    郑萧桢. 无人机航拍视频序列意见征集[C]. AVS M4503, 大连, 中国, 2016: 1.

    ZHENG Xiaozhen. Opinion collection of aerial video[C]. AVS M4503, Dalian, China, 2016: 1.
    ROSEWARNE C, SHARMAN K, and FLYNN D. Common test conditions and software reference configurations for HEVC range extensions[C]. JCTVC-P1006, San Jose, USA, 2014: 1–11.
    BJØNTEGAARD G. Calculation of average PSNR differences between RD-curves[C]. VCEG-M33, Austin, USA, 2001: 1–4.
    BJØNTEGAARD G. Improvements of the BD-PSNR model. ITU-T SG16 Q.6 document[C]. VCEG-AI11, Berlin, Germany, 2008: 1–2.
    吴成家, 陈大鹏, 陈芳, 等. 一种计算综合PSNR的方法[C]. AVS M3830, 杭州, 中国, 2016: 1–11.

    WU Chengjia, CHEN Dapeng, CHEN Fang, et al. A method for calculating integrated PSNR[C]. AVS M3830, Hangzhou, China, 2016: 1–11.
  • 加载中
图(6) / 表(5)
计量
  • 文章访问数:  1895
  • HTML全文浏览量:  723
  • PDF下载量:  32
  • 被引次数: 0
出版历程
  • 收稿日期:  2018-02-06
  • 修回日期:  2018-09-14
  • 网络出版日期:  2018-09-18
  • 刊出日期:  2018-12-01

目录

    /

    返回文章
    返回