How similar is Amp vs. fastai’s methodology for enabling mixed-precision training? For example, do they differ in designation of which pieces are/aren’t executed in FP16?
Should Amp provide additional speed/accuracy improvements in either all or specific scenarios, or is it more focused on ease of use?
Very interested in hearing any other points of comparison or opinions on this topic!
They’re very similar. We’re working with Nvidia to try to move more of our backend code into Apex. I think the fastai API is more flexible and usable, which is why we’re trying to get the best of both.