I’m still confused by notation. In the paper it says each fi function within f returns a scalar. So:
f = [f1(x), f2(x), … fm(x)], where x is [x1, x2, …xn]
So here it suggests that f1 is a function of all of the parameters in x… i.e. [x1, x2, …xn]
However, in @beecoder 's post it says that “the ith scalar function in f(x) is a function of (only) the ith term in vector x”.
This suggests that f1 is a function of only the first parameter (x1) in vector x.
What am I missing?
EDIT: I think I figured it out. For element-wise operations, it only makes sense if f1 only applies to x1, f2 to x2, and so on.
Usually, f = [f1(x), f2(x), … fm(x)], where x is [x1, x2, …xn]. Here, f1 is a function of [x1, x2, …xn].
However, if we want to do element-wise operations of f with g = [g1(w), g2(w), … gm(w)], we need functions that are only sensitive to a single element of their respective vectors. For element-wise, it doesn’t make any sense to consider f1([x1, x2, … xn]), since that would not be element-wise! Instead for element-wise, f1 is only defined for x1, and so on.
Therefore, the equation y1 = f1(x) O g1(w) is the same as y1 = f1(x1) O g1(w1).