firstly remember that DL Part 2 2018 uses the old version of the fastai library. Things have changed in the new one.
Note that the first two values are not zero but just small numbers 2e-8 and 2e-7. The reason is that we don’t want to modify the very first layers of the network too much.
As for the other question, take a look at the comments in source here.