Deep Convolutional Neural Network Architecture With Reconfigurable Computation Patterns

Tu, Fengbin; Yin, Shouyi<sup>*</sup>; Ouyang, Peng; Tang, Shibin; Liu, Leibo; Wei, Shaojun

doi:10.1109/TVLSI.2017.2688340

摘要

Deep convolutional neural networks (DCNNs) have been successfully used in many computer vision tasks. Previous works on DCNN acceleration usually use a fixed computation pattern for diverse DCNN models, leading to imbalance between power efficiency and performance. We solve this problem by designing a DCNN acceleration architecture called deep neural architecture (DNA), with reconfigurable computation patterns for different models. The computation pattern comprises a data reuse pattern and a convolution mapping method. For massive and different layer sizes, DNA reconfigures its data paths to support a hybrid data reuse pattern, which reduces total energy consumption by 5.9 similar to 8.4 times over conventional methods. For various convolution parameters, DNA reconfigures its computing resources to support a highly scalable convolution mapping method, which obtains 93% computing resource utilization on modern DCNNs. Finally, a layer-based scheduling framework is proposed to balance DNA's power efficiency and performance for different DCNNs. DNA is implemented in the area of 16 mm(2) at 65 nm. On the benchmarks, it achieves 194.4 GOPS at 200 MHz and consumes only 479 mW. The system-level power efficiency is 152.9 GOPS/W (considering DRAM access power), which outperforms the state-of-the-art designs by one to two orders.

出版日期2017-8
单位清华大学

全文

访问全文

收藏分享被引(234) 浏览

更新时间：2024-05-13 10:36

Deep Convolutional Neural Network Architecture With Reconfigurable Computation Patterns

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友