You could build one model to predict weather from hour and date, and a second model to predict price from hour, date and weather, but use the weather prediction as input for the price model’s prediction.
I’m looking for an option to use this column without building another model.
maybe using an aggressive dropout on this column will help? I will be happy to hear about other options.
There is one more example of this kind of issue at the Rossman competition where we have the customers column only at the training set (Jeremy didn’t use it in his model).
You could train with weather but provide a mean value at inference. Perhaps your embeddings would be richer and more predictive with the extra training feature but I don’t think that would work in every case.