Part 2 for A disciplined approach to neural network hyper-parameters: Part 1

bhavikngala · February 22, 2019, 11:33pm

Hi,
Is there a part2 paper for the A disciplined approach to neural network hyper-parameters: Part 1 – learning rate, batch size, momentum, and weight decay paper?. The paper did indicate there maybe a part 2 to it, but I cant find it on google.
Thanks and regards.

hotessy · April 16, 2019, 11:19am

There isn’t any as of April 2019.

Th3Lourde · June 23, 2020, 11:17pm

I emailed the author, got:

Dear Eli,

Thank you for your interest in my research.

When I wrote that report I was planning a second part on architectures but a number of papers that appeared that seemed to cover the topic, so there isn’t a part 2.

Two papers I would recommend are:

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks at https://arxiv.org/abs/1905.11946

Designing Network Design Spaces at https://arxiv.org/abs/2003.13678