Gradient Formula used in NMF

asutosh97 · October 2, 2018, 1:34pm

Can anyone tell where the formula for gradient came from which is used while writing NMF from scratch?

roy_arunabha · January 16, 2019, 1:38pm

All the NMF demonstrations in this lecture use a common and consistent energy functional given by : E = (1/2) * Trace [ (R.T @ R) + lamda * (mu - H).T @ (mu - H) + lamda * ( mu - W ).T @ (mu - W) ] where lamda = 0, wherever H, W > mu.

The first term is the squared Frobenius norm of the residual matrix, R = M - WH. The second and third terms are quadratic regularizers which in the case of ill-posed problems, depending upon the size of lamda, restrict the solutions from taking on undesirable values. (In this case, that condition is negative values of W and H elements)

Taking partial derivatives of this energy functional w.r.t. H, W lead to the gradient expressions computed explicitly and used in the python notebook.

beniamin · February 20, 2019, 11:52pm

Why do we use ‘energy functional’ instead of derivatives of Frobenius norm?

Marietta · January 26, 2021, 7:10am

Thanks for the information. Keep suggesting such post.

Marietta · January 27, 2021, 9:09am

Kroger ESS Schedule