改进神经网络的图像识别系统设计与硬件实现

魏东; 董博晨; 刘亦青

doi:10.11999/JEIT200202

改进神经网络的图像识别系统设计与硬件实现

doi: 10.11999/JEIT200202 cstr: 32379.14.JEIT200202

魏东^{1, 2, ,},
董博晨¹,
刘亦青³

1.
北京建筑大学电气与信息工程学院北京 100044
2.
北京市科学技术委员会建筑大数据智能处理方法研究北京市重点实验室北京 100044
3.
北京亚鼎智能技术有限公司北京 100071

基金项目: 北京市属高校高水平创新团队建设计划项目(IDHT20190506)，国家自然科学基金(61871020)，北京市教委科技计划重点项目(KZ201810016019)

详细信息

作者简介:
魏东：女，1968年生，教授，研究方向为人工神经网络优化计算

董博晨：男，1995年生，硕士，研究方向为神经网络芯片设计

刘亦青：男，1987年生，硕士，研究方向为控制科学与工程

通讯作者:
魏东　weidong@bucea.edu.cn

中图分类号: TN911.73; TP183
计量
- 文章访问数: 1369
- HTML全文浏览量: 700
- PDF下载量: 177
- 被引次数: 0
出版历程
- 收稿日期: 2020-03-24
- 修回日期: 2020-09-23
- 网络出版日期: 2020-12-09
- 刊出日期: 2021-07-10

Design and Hardware Implementation of Image Recognition System Based on Improved Neural Network

Dong WEI^{1, 2
, ,},
Bochen DONG¹,
Yiqing LIU³

1.
School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 100044, China
2.
Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing Municipal Science and Technology Commission, Beijing 100044, China
3.
Beijing Yading Intelligent Technology Limited Company, Beijing 100071, China

Funds: The High Level Innovation Team Construction Project of Beijing Municipal Universities (IDHT20190506), The National Natural Science Foundation of China (61871020), The Key Science and Technology Plan Project of Beijing Municipal Education Commission of China (KZ201810016019)

摘要

摘要: 针对现有图像识别系统大多采用软件实现，无法利用神经网络并行计算能力的问题。该文提出一套基于FPGA的改进RBF神经网络硬件化图像识别系统，将乘法运算改为加法运算解决了神经网络计算复杂不便于硬件化的问题，并且提出一种基于位比较的排序电路解决了大量数据的快速排序问题，以此为基础开发了多目标图像识别应用系统。系统特征提取部分采用FPGA实现，图像识别部分采用ASIC电路实现。实验结果表明，该文所提出的改进RBF神经网络算法平均识别时间较LeNet-5, AlexNet和VGG16缩短50%；所开发的硬件系统完成对10000张样本图片识别的时间为165 μs，对比于DSP芯片系统所需426.6 μs，减少了60%左右。
- FPGA /
- ASIC电路 /
- RBF神经网络 /
- 图像识别系统
Abstract: To solve the problem that most existing image recognition systems are implemented in software which can not utilize the parallel computing power of neural networks, this paper proposes a FPGA image recognition system based on improved RBF neural network hardware. The multiplication operation in the neural networks is complex and inconvenient for hardware implementation. Furthermore, a sort circuit based on bit comparison is designed to solve the problem of fast sorting of a large number of data. Then, a multi-target image recognition application system is developed. The feature extraction part in the developed system is implemented by FPGA, and the image recognition part is implemented by ASIC circuit. The experimental results show that the average recognition time of the improved RBF neural network algorithm proposed is 50% shorter than that of LeNet-5, AlexNet and VGG16, and the time for the developed hardware system to recognize 10000 sample pictures is 165μs, which is reduced by about 60% compared with 426.6μs required by a DSP chip system.
- FPGA /
- ASIC circuit /
- RBF neural networks /
- Image recognition system

HTML全文

图 1 基于改进RBF神经网络电路的图像识别系统架构

下载: 全尺寸图片幻灯片

图 2 多目标识别系统学习流程

下载: 全尺寸图片幻灯片

图 3 改进RBF神经网络模型

下载: 全尺寸图片幻灯片

图 4 改进RBF神经网络学习模型

下载: 全尺寸图片幻灯片

图 5 快速排序电路

下载: 全尺寸图片幻灯片

图 6 系统算法状态机

下载: 全尺寸图片幻灯片

表 1 测试数据集为MNIST时不同网络模型对比实验

网络模型	准确率	平均识别时间(s)
LeNet-5	0.989	1.2
AlexNet	0.991	1.5
VGG16	0.997	2.3
改进RBF神经网络	0.996	0.9

下载: 导出CSV

表 2 测试数据集为CIFAR-10时不同网络模型对比实验

网络模型	准确率	平均识别时间(s)
LeNet-5	0.787	2.4
AlexNet	0.810	3.3
VGG16	0.832	5.5
改进RBF神经网络	0.828	1.3

下载: 导出CSV

表 3 测试数据集为VOC2012时不同网络模型对比实验

网络模型	准确率	平均识别时间(s)
LeNet-5	0.757	2.7
AlexNet	0.783	3.5
VGG16	0.813	6.7
改进RBF神经网络	0.808	2.6

下载: 导出CSV

表 4 基于DSP图像识别系统与本文提出图像识别系统性能比较

计算量	DSP芯片	本文系统
时钟频率(MHz)	500	15
ALU数量	6	1024
运算位宽(bit)	32	8
单个样本大小(Byte)	256	256
每个周期能进行加法次数	6	1024×2=2048
每次加法处理数据(Byte)	(32/8)×6=24	(8/8)×2048=2048
完成两个样本的比较需要周期数	(2×256/24)=21.33	(2×256/2048)=0.25
完成两个样本比较的时间(ns)	21.33×2=42.66	0.25×66=16.5
和所有样本比较所需时间(μs)	426.6	165

下载: 导出CSV

参考文献(11)

[1]	李国良, 周煊赫, 孙佶, 等. 基于机器学习的数据库技术综述[J]. 计算机学报, 2020, 43(11): 2019–2049. LI Guoliang, ZHOU Xuanhe, SUN Ji, et al. A survey of machine learning based database techniques[J]. Chinese Journal of Computers, 2020, 43(11): 2019–2049.
[2]	刘方园, 王水花, 张煜东. 深度置信网络模型及应用研究综述[J]. 计算机工程与应用, 2018, 54(1): 11–18, 47. doi: 10.3778/j.issn.1002-8331.1711-0028 LIU Fangyuan, WANG Shuihua, and ZHANG Yudong. Review of deep confidence network model and application research[J]. Computer Engineering and Applications, 2018, 54(1): 11–18, 47. doi: 10.3778/j.issn.1002-8331.1711-0028
[3]	LIANG Tian and AFZEL N. Software reliability prediction using recurrent neural network with Bayesian regularization[J]. International Journal of Neural Systems, 2004, 14(3): 165–174. doi: 10.1142/S0129065704001966
[4]	GOODFELLOW I J, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial nets[C]. Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada, 2014: 2672–2680.
[5]	SABOUR S, FROSST N, and HINTON G E. Dynamic routing between capsules[C]. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, USA, 2017: 3856–3866.
[6]	任源, 潘俊, 刘京京, 等. 人工智能芯片的研究进展[J]. 微纳电子与智能制造, 2019, 1(2): 20–34. REN Yuan, PAN Jun, LIU Jingjing, et al. Overview of artificial intelligence chip development[J]. Micro/Nano Electronics and Intelligent Manufacturing, 2019, 1(2): 20–34.
[7]	秦华标, 曹钦平. 基于FPGA的卷积神经网络硬件加速器设计[J]. 电子与信息学报, 2019, 41(11): 2599–2605. doi: 10.11999/JEIT190058 QIN Huabiao and CAO Qinping. Design of convolutional neural networks hardware acceleration based on FPGA[J]. Journal of Electronics &Information Technology, 2019, 41(11): 2599–2605. doi: 10.11999/JEIT190058
[8]	韩栋, 周聖元, 支天, 等. 智能芯片的评述和展望[J]. 计算机研究与发展, 2019, 56(1): 7–22. doi: 10.7544/issn1000-1239.2019.20180693 HAN Dong, ZHOU Shengyuan, ZHI Tian, et al. A survey of artificial intelligence chip[J]. Journal of Computer Research and Development, 2019, 56(1): 7–22. doi: 10.7544/issn1000-1239.2019.20180693
[9]	王巍, 周凯利, 王伊昌, 等. 基于快速滤波算法的卷积神经网络加速器设计[J]. 电子与信息学报, 2019, 41(11): 2578–2584. doi: 10.11999/JEIT190037 WANG Wei, ZHOU Kaili, WANG Yichang, et al. Design of convolutional neural networks accelerator based on fast filter algorithm[J]. Journal of Electronics &Information Technology, 2019, 41(11): 2578–2584. doi: 10.11999/JEIT190037
[10]	伍家松, 达臻, 魏黎明, 等. 基于分裂基-2/(2a)FFT算法的卷积神经网络加速性能的研究[J]. 电子与信息学报, 2017, 39(2): 285–292. doi: 10.11999/JEIT160357 WU Jiasong, DA Zhen, WEI Liming, et al. Acceleration performance study of convolutional neural network based on split-radix-2/(2a) FFT algorithms[J]. Journal of Electronics &Information Technology, 2017, 39(2): 285–292. doi: 10.11999/JEIT160357
[11]	张烨, 许艇, 冯定忠, 等. 基于难分样本挖掘的快速区域卷积神经网络目标检测研究[J]. 电子与信息学报, 2019, 41(6): 1496–1502. doi: 10.11999/JEIT180702 ZHANG Ye, XU Ting, FENG Dingzhong, et al. Research on faster RCNN object detection based on hard example mining[J]. Journal of Electronics &Information Technology, 2019, 41(6): 1496–1502. doi: 10.11999/JEIT180702

施引文献

资源附件(0)

访问统计

图(6) / 表(4)

计量

文章访问数: 1369
HTML全文浏览量: 700
PDF下载量: 177
被引次数: 0

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

改进神经网络的图像识别系统设计与硬件实现

doi: 10.11999/JEIT200202 cstr: 32379.14.JEIT200202

作者简介:
魏东：女，1968年生，教授，研究方向为人工神经网络优化计算

董博晨：男，1995年生，硕士，研究方向为神经网络芯片设计

刘亦青：男，1987年生，硕士，研究方向为控制科学与工程

通讯作者:
魏东　weidong@bucea.edu.cn

计量

Design and Hardware Implementation of Image Recognition System Based on Improved Neural Network

计量

目录

留言板

改进神经网络的图像识别系统设计与硬件实现

doi: 10.11999/JEIT200202 cstr: 32379.14.JEIT200202

作者简介: 魏东：女，1968年生，教授，研究方向为人工神经网络优化计算 董博晨：男，1995年生，硕士，研究方向为神经网络芯片设计 刘亦青：男，1987年生，硕士，研究方向为控制科学与工程

通讯作者: 魏东 weidong@bucea.edu.cn

计量

出版历程

Design and Hardware Implementation of Image Recognition System Based on Improved Neural Network

计量

出版历程

目录

作者简介:
魏东：女，1968年生，教授，研究方向为人工神经网络优化计算

董博晨：男，1995年生，硕士，研究方向为神经网络芯片设计

刘亦青：男，1987年生，硕士，研究方向为控制科学与工程

通讯作者:
魏东　weidong@bucea.edu.cn