precompute=True

yinterian · November 4, 2017, 9:15am

In the next few lectures Jeremy will explain the concept of differential learning rates. The basic idea is that you use different learning rates for different layers. If you are closer to the input learning rates should be lower and higher if you are closer to the output. “model_meta” is used to define 3 groups of layers for the purpose of using differential learning rates.