Question about the value that decides if an image is a 3 or a 7

ForBo7 · August 23, 2023, 9:23am

Forgot to click “Reply” on the response I meant to send yesterday (－‸ლ)

That forum post definitely is helpful, and now I think I understand why 0 is chosen.

Firstly, it’s because the way that the weights that have been chosen means the model ends up producing predictions that are either 0 or 1.

Secondly, it’s because of the sigmoid loss function we use later on. This function maps any input number to a value between 0 and 1, with negative numbers resulting in an output closer to 0, positive numbers resulting in an output closer to 1, and a 0 resulting in an output of 0.5 — the middle point of the sigmoid. Later on in the notebook, we use the sigmoid function in such a way that any output larger than 0.5 is one class, and any output less than 0.5 is another class.

That post definitely helped clear this confuzzlement.