Sorry for using this thread as a lab notebook… but today I was trying to tweak the “hyperparameter” (the 0.7 in initializing parameters with uniform_(-0.7,0.7)
in conv_twist
) and the 5-epoch accuracy went further up as I increased it. The way I had in mind is that (center_x, center_y) is the point around which the conv_twist is “twisting” the image, which in this scale is situated at the [-1,1]x[-1,1] square.
turning it up to 1.5 (so the center can be outside the image)
[0.741156 0.73632 0.744973 0.751591 0.72563 ]
0.7399338 0.008722797
at 2.0,
[0.757954 0.747264 0.748282 0.750064 0.75719 ]
0.75215065 0.004522691
at 2.5
[0.759735 0.755663 0.75999 0.753372 0.761262]
0.75800455 0.0029829554
(I’m not sure we can still call this ResNet, as it is using 3x3 exclusively.)
I will start a new thread [Update: new thread here]. This reminds me of the PolyMath projects (of Terry Tao and others), a series of online collaborative projects in research mathematics in recent years. Participants would post small comments to a blog post describing the problem, and once in a while (when comments reach the hundreds) the host would start a new post, summarizing what they have learned, and discussion would take off from there. A couple of success stories. I’m new to fastai, and it is probably what’s happening here too.