The matrix form of the Backpropagation algorithm

In a multi-layered neural network weights and neural connections can be treated as matrices, the neurons of one layer can form the columns, and the neurons of the other layer can form the rows of the matrix. The figure below shows a network and its parameter matrices.

nn_matrices — Matrices used in modelling a multi layered neural network

The meanings of vectors and matrices above:
nⁱⁿ_l: the input of the l. layer.
n^out_l: the output of the l. layer. The input vector of the neural network is n^out₀, and the output vector is n^out_L (l=1…L).

b_l: the bias (threshold) vector of the l. layer.
W_l: Weight parameter matrix between layers l and (l-1).

n^out_l=f(nⁱⁿ_l): Activation function of the neurons.

For example the weight matrix of the 3^rd layer can be expressed as below:

Using matrices for forward propagation:

The backpropagation algorithm:

The vector e means the error of the current layer, and t is the current target vector. After determining the errors on all layers the gradients can be computed in one single forward-propagation step:

where P means the number of the training patterns. The algorithm above must be executed for all patterns.

DeepTrainer

Deep Learning algorithm R&D for Artificial Neural Networks

The matrix form of the Backpropagation algorithm

Leave a Reply Cancel reply