How to calculate the derivative of a matrix multiplication such as WHW
Calculating derivatives are key to training AI neural networks, being part of a backpropagation step. The mathematical details of how to calculate the derivative of a matrix multiplication is not very clear. For example, if we have a \(MX1\) weight vector \(W\) and a \(MXM\) matrix \(H\), we can calculate the derivative of \(W^THW\) as follows:
Enjoy Reading This Article?
Here are some more articles you might like to read next: