Design Space Exploration of Neural Network Activation Function Circuits

作者:Yang, Tao; Wei, Yadong; Tu, Zhijun; Zeng, Haolun; Kinsy, Michel A.; Zheng, Nanning; Ren, Pengju*
来源:IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2019, 38(10): 1974-1978.
DOI:10.1109/TCAD.2018.2871198

摘要

The widespread application of artificial neural networks has prompted researchers to experiment with field-programmable gate array and customized ASIC designs to speed up their computation. These implementation efforts have generally focused on weight multiplication and signal summation operations, and less on activation functions used in these applications. Yet, efficient hardware implementations of nonlinear activation functions like exponential linear units (ELU), scaled ELU (SELU), and hyperbolic tangent (tanh), are central to designing effective neural network accelerators, since these functions require lots of resources. In this paper, we explore efficient hardware implementations of activation functions using purely combinational circuits, with a focus on two widely used nonlinear activation functions, i.e., SELU and tanh. Our experiments demonstrate that neural networks are generally insensitive to the precision of the activation function. The results also prove that the proposed combinational circuit-based approach is very efficient in terms of speed and area, with negligible accuracy loss on the MNIST, CIFAR-10, and IMAGENET benchmarks. Synopsys design compiler synthesis results show that circuit designs for tanh and SELU can save between x3.13 similar to x7.69 and x4.45 similar to x8.45 area compared to the look-up table/memory-based implementations, and can operate at 5.14 GHz and 4.52 GHz using the 28-nm SVT library, respectively. The implementation is available at: https://github.com/ThomasMrY/ActivationFunctionDemo.