A Coarse-Grained Dual-Convolver Based CNN Accelerator with High Computing Resource Utilization

Yi Lu, Yi Lin Wu, Juinn Dar Huang

研究成果: Conference contribution同行評審

摘要

Deep learning technologies have been developed rapidly in recent years and have played an important role in our lives. Among them, convolutional neural network (CNN) performs well in many applications. The quality of result is generally getting better as the number of convolutional layers increases, which also increases the computational complexity. Hence, a highly resource-efficient accelerator is demanded. In this paper, we propose a new CNN accelerator that features a delay-chain-free input data aligner as well as a dual-convolver processing element (DCPE). Our architecture does not require delay chains with a large number of registers for input data alignment, which not only reduces the area and power but improves the overall resource utilization. In addition, a set of DCPEs shares the same input aligner to produce multiple output feature maps concurrently, which offers the desirable computing power and reduces the external memory traffic. An accelerator instance with 8 DCPEs (144 MACs) has been implemented using TSMC 40nm process. The internal logic only consumes 285K gates and the total internal memory size is merely 44KB. As running VGG-16, the average performance is 190GOPS (@750MHz), the resource (MAC) utilization reaches 8S.3%, and the energy efficiency is 481GOPS/W.

原文English
主出版物標題Proceedings - 2020 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2020
發行者Institute of Electrical and Electronics Engineers Inc.
頁面198-202
頁數5
ISBN(電子)9781728149226
DOIs
出版狀態Published - 八月 2020
事件2020 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2020 - Genova, Italy
持續時間: 31 八月 20202 九月 2020

出版系列

名字Proceedings - 2020 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2020

Conference

Conference2020 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2020
國家Italy
城市Genova
期間31/08/202/09/20

指紋 深入研究「A Coarse-Grained Dual-Convolver Based CNN Accelerator with High Computing Resource Utilization」主題。共同形成了獨特的指紋。

引用此