AdamW & Imagenet

I’m looking to apply AdamW to training resnet18 on Imagenet from scratch. I’ve been looking around for some existing implementations, but so far I can’t find any dealing specifically with training imagenet using AdamW from scratch?

-The recent “imagenet in 18 minutes” work used SGD. I imagine this was probably done for speed & simplicity

-The original AdamW paper looked at a downsampled version of imagenet.

-sgugger’s original work and the dawnbench work only use AdamW for cifar or for fine-tuning an imagenet based model to the Stanford cars dataset

Have there been any successful applications of AdamW to imagenet or am I barking up the wrong tree here?