低秩大模板二维卷积算法的脉动阵列设计
SYSTOLIC ARRAY DESIGN FOR 2-D CONVOLUTIONS WITH BIG KERNEL IN LOW-RANK
-
摘要: 本文针对低秩大模板二维卷积运算的特点,给出了其快速算法,并利用基于相关图的三步骤映射法设计了其脉动阵列实现结构。该结构并行效率高,并可达到线性加速比。
-
关键词:
- 低秩大模板二维卷积; 映射; 脉动阵列
Abstract: The characteristics of 2-D convolutions with big kernel in low-rank are analysed, and a fast algorithm is given. Then a systolic array implementation, which is derived by a three-stage dependence-graph-based mapping approach, is presented. It is shown that the architecture has a high efficiency for parallel processing and a nearly linear speed-up. -
Kung H T, Lam M S. J. Parallel and Distributed Computting, 1984, 1(1): 32-63.[2]De Vos L, Stegherr M. A Family of Application-Specific VLSI Architecture for the Block-Matching[3]Algorithm. in Systolic Array Processors, J.McCanny, J.Mcwhirter, E.Swartzlander, ed., Hertford-shire: Prentice-Hall, Inc., 1989, 421-430.[4]Bombardieri J. IEEE Trans. on Signal Processing, 1992, SP-40(5): 1253-1257.[5]Kung S Y. VLSI Array Processors, Englewood Cliffs: Prentice-Hall, Inc., 1988, 119-211.
计量
- 文章访问数: 2168
- HTML全文浏览量: 157
- PDF下载量: 482
- 被引次数: 0