Question about the value that decides if an image is a 3 or a 7

Forgot to click “Reply” on the response I meant to send yesterday (-‸ლ)

That forum post definitely is helpful, and now I think I understand why 0 is chosen.

  • Firstly, it’s because the way that the weights that have been chosen means the model ends up producing predictions that are either 0 or 1.
  • Secondly, it’s because of the sigmoid loss function we use later on. This function maps any input number to a value between 0 and 1, with negative numbers resulting in an output closer to 0, positive numbers resulting in an output closer to 1, and a 0 resulting in an output of 0.5 — the middle point of the sigmoid. Later on in the notebook, we use the sigmoid function in such a way that any output larger than 0.5 is one class, and any output less than 0.5 is another class.

That post definitely helped clear this confuzzlement. :smile:

5 Likes