A novel softplus linear unit for deep convolutional neural networks

Zhao, Huizhen<sup>*</sup>; Liu, Fuxian; Li, Longyue; Luo, Chang

doi:10.1007/s10489-017-1028-7

摘要

Current improvements in the performance of deep neural networks are partly due to the proposition of rectified linear units. A ReLU activation function outputs zero for negative component, inducing the death of some neurons and a bias shift of the outputs, which causes oscillations and impedes learning. According to the theory that "zero mean activations improve learning ability", a softplus linear unit (SLU) is proposed as an adaptive activation function that can speed up learning and improve performance in deep convolutional neural networks. Firstly, for the reduction of the bias shift, negative inputs are processed using the softplus function, and a general form of the SLU function is proposed. Secondly, the parameters of the positive component are fixed to control vanishing gradients. Thirdly, the rules for updating the parameters of the negative component are established to meet back- propagation requirements. Finally, we designed deep auto-encoder networks and conducted several experiments with them on the MNIST dataset for unsupervised learning. For supervised learning, we designed deep convolutional neural networks and conducted several experiments with them on the CIFAR-10 dataset. The experiments have shown faster convergence and better performance for image classification of SLU-based networks compared with rectified activation functions.

出版日期2018-7
单位中国人民解放军空军工程大学

全文

访问全文

收藏分享被引(8) 浏览

更新时间：2021-12-26 19:22

A novel softplus linear unit for deep convolutional neural networks

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友