@brendan, just a quick clarification that might be helpful during the implementation:
Last night I implemented the Tiramsu and it’s working as it is, but I need to clean it and replicate the results perhaps first, here is a problem though:
There is this thing called ‘m’, it’s the factor of which the feature map grows, the 103 layered one grows at the rate of 16, except in the paper at the middle it says:
880, which should be ‘896’ from the result of previous stage = 656 feature_map + (16 growth_rate * 15 conv_layers)Also, the 103 will require at least 11 to 12 Gb to compile (TitanX), try going for the lower ones that grows at rate of 12 and have 4 layer per each dense-block. the 53 or 67.