高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

双鉴别器盲超分重建方法研究

卢迪 于国梁

卢迪, 于国梁. 双鉴别器盲超分重建方法研究[J]. 电子与信息学报, 2024, 46(1): 277-286. doi: 10.11999/JEIT221502
引用本文: 卢迪, 于国梁. 双鉴别器盲超分重建方法研究[J]. 电子与信息学报, 2024, 46(1): 277-286. doi: 10.11999/JEIT221502
LU Di, YU Guoliang. Research on Blind Super-resolution Reconstruction with Double Discriminator[J]. Journal of Electronics & Information Technology, 2024, 46(1): 277-286. doi: 10.11999/JEIT221502
Citation: LU Di, YU Guoliang. Research on Blind Super-resolution Reconstruction with Double Discriminator[J]. Journal of Electronics & Information Technology, 2024, 46(1): 277-286. doi: 10.11999/JEIT221502

双鉴别器盲超分重建方法研究

doi: 10.11999/JEIT221502
详细信息
    作者简介:

    卢迪:女,教授,博士,研究方向为数据融合、图像处理

    于国梁:男,硕士生,研究方向为图像处理、超分辨率重建

    通讯作者:

    卢迪 ludizeng@hrbust.edu.cn

  • 中图分类号: TN911.73; TP391

Research on Blind Super-resolution Reconstruction with Double Discriminator

  • 摘要: 图像超分变率重建方法在公共安全检测、卫星成像、医学和照片恢复等方面有着十分重要的用途。该文对基于生成对抗网络的超分辨率重建方法进行研究,提出一种基于纯合成数据训练的真实世界盲超分算法(Real-ESRGAN)的UNet3+双鉴别器Real-ESRGAN方法(Double Unet3+ Real-ESRGAN, DU3-Real-ESRGAN)。首先,在鉴别器中引入UNet3+结构,从全尺度捕捉细粒度的细节和粗粒度的语义。其次,采用双鉴别器结构,一个鉴别器学习图像纹理细节,另一个鉴别器关注图像边缘,实现图像信息互补。在Set5, Set14, BSD100和Urban100数据集上,与多种基于生成对抗网络的超分重建方法相比,除Set5数据集外,DU3-Real-ESRGAN方法在峰值信噪比(PSNR)、结构相似性(SSIM)和无参图像考评价指标(NIQE)都优于其他方法,产生了更直观逼真的高分辨率图像。
  • 图  1  Real-ESRGAN生成器网络结构

    图  2  Real-ESRGAN鉴别器网络结构

    图  3  UNet++和UNet3+网络结构

    图  4  UNet3+网络decoder结构图

    图  5  DU3-Real-ESRGAN网络结构

    图  6  DIV2K数据集HR图像与LR图像对比图

    图  7  Set5数据集对比图

    图  8  BSD100数据集对比图

    图  9  Set14数据集不同算法对比

    图  10  Urban100数据集不同算法对比

    表  1  PSNR/SSIM值对比

    数据集算法
    SRGANEDSRESRGANReal-ESRGANU3-RealESRGANDU3-Real-ESRGAN
    Set528.99/0.79128.80/0.78728.81/0.786830.52/0.87830.01/0.86830.24/0.870
    Set1427.03/0.81526.64/0.80327.13/0.74128.71/0.83028.55/0.84529.57/0.847
    BSD10027.85/0.74528.34/0.82727.33/0.80829.14/0.85529.25/0.85130.19/0.859
    Urban10027.45/0.82527.71/0.742027.29/0.83628.82/0.85029.15/0.79530.05/0.857
    下载: 导出CSV

    表  2  NIQE值对比

    数据集算法
    SRGANEDSRESRGANReal-ESRGANU3-RealESRGNDU3-Real-ESRGAN
    Set55.671 25.137 24.580 63.506 43.602 13.840 0
    Set147.559 35.158 84.409 63.541 33.533 23.516 8
    BSD1007.341 36.271 53.817 23.691 63.267 53.247 4
    Urban1007.108 96.563 24.199 63.929 03.454 33.399 3
    下载: 导出CSV
  • [1] 陶状, 廖晓东, 沈江红. 双路径反馈网络的图像超分辨重建算法[J]. 计算机系统应用, 2020, 29(4): 181–186. doi: 10.15888/j.cnki.csa.007344

    TAO Zhuang, LIAO Xiaodong, and SHEN Jianghong. Dual stream feedback network for image super-resolution reconstruction[J]. Computer Systems &Applications, 2020, 29(4): 181–186. doi: 10.15888/j.cnki.csa.007344
    [2] 陈栋. 单幅图像超分辨率重建算法研究[D]. [硕士论文], 华南理工大学, 2020.

    CHEN Dong. Research on single image super-resolution reconstruction algorithm[D]. [Master dissertation], South China University of Technology, 2020.
    [3] KAPPELER A, YOO S, DAI Qiqin, et al. Video super-resolution with convolutional neural networks[J]. IEEE Transactions on Computational Imaging, 2016, 2(2): 109–122. doi: 10.1109/TCI.2016.2532323
    [4] JADERBERG M, SIMONYAN K, ZISSERMAN A, et al. Spatial transformer networks[C]. The 28th International Conference on Neural Information Processing Systems, Montreal, Canada, 2015: 2017–2025.
    [5] IRANI M and PELEG S. Super resolution from image sequences[C]. [1990] Proceedings. 10th International Conference on Pattern Recognition, Atlantic City, USA, 1990: 115–120.
    [6] STARK H and OSKOUI P. High-resolution image recovery from image-plane arrays, using convex projections[J]. Journal of the Optical Society of America A, 1989, 6(11): 1715–1726. doi: 10.1364/JOSAA.6.001715
    [7] DONG Chao, LOY C C, HE Kaiming, et al. Learning a deep convolutional network for image super-resolution[C]. 13th European Conference on Computer Vision, Zurich, Switzerland, 2014: 184–199.
    [8] DONG Chao, LOY C C, and TANG Xiaoou. Accelerating the super-resolution convolutional neural network[C]. 14th European Conference on Computer Vision. Amsterdam, The Netherlands, 2016: 391–407.
    [9] PARK S J, SON H, CHO S, et al. SRFeat: Single image super-resolution with feature discrimination[C]. The 15th European Conference on Computer Vision, Munich, Germany, 2018: 455–471.
    [10] ZHANG Yulun, LI Kunpeng, LI Kai, et al. Image super-resolution using very deep residual channel attention networks[C]. The 15th European Conference on Computer Vision, Munich, Germany, 2018: 294–310.
    [11] LEDIG C, THEIS L, HUSZÁR F, et al. Photo-realistic single image super-resolution using a generative adversarial network[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 105–114.
    [12] LIM B, SON S, KIM H, et al. Enhanced deep residual networks for single image super-resolution[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Honolulu, USA, 2017: 1132–1140.
    [13] WANG Xintao, YU Ke, WU Shixiang, et al. ESRGAN: Enhanced super-resolution generative adversarial networks[C]. European Conference on Computer Vision, Munich, Germany, 2018: 63–79.
    [14] SOH J W, PARK G Y, JO J, et al. Natural and realistic single image super-resolution with explicit natural manifold discrimination[C]. The 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019: 8114-8123.
    [15] WANG Xintao, XIE Liangbin, DONG Chao, et al. Real-ESRGAN: Training real-world blind super-resolution with pure synthetic data[C]. 2021 IEEE/CVF International Conference on Computer Vision Workshops, Montreal, Canada, 2021: 1905–1914.
    [16] SAJJADI M S M, SCHÖLKOPF B, and HIRSCH M. EnhanceNet: Single image super-resolution through automated texture synthesis[C]. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017: 4501–4510.
    [17] ZHANG Kai, LI Yawei, ZUO Wangmeng, et al. Plug-and-play image restoration with deep denoiser prior[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(10): 6360–6376. doi: 10.1109/TPAMI.2021.3088914
    [18] HUANG Huimin, LIN Lanfen, TONG Ruofeng, et al. UNet 3+: A full-scale connected UNet for medical image segmentation[C]. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020: 1055–1059.
    [19] ZHOU Zongwei, SIDDIQUEE M M R, TAJBAKHSH N, et al. Unet++: A nested U-Net architecture for medical image segmentation[M]. Stoyanov D, Taylor Z, Carneiro G, et al. Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Cham: Springer, 2018: 3–11.
    [20] MITTAL A, SOUNDARARAJAN R, and BOVIK A C. Making a “Completely Blind” image quality analyzer[J]. IEEE Signal Processing Letters, 2013, 20(3): 209–212. doi: 10.1109/LSP.2012.2227726
  • 加载中
图(10) / 表(2)
计量
  • 文章访问数:  195
  • HTML全文浏览量:  221
  • PDF下载量:  32
  • 被引次数: 0
出版历程
  • 收稿日期:  2022-12-02
  • 修回日期:  2023-09-13
  • 网络出版日期:  2023-09-15
  • 刊出日期:  2024-01-17

目录

    /

    返回文章
    返回