Speeding up copying unets between system memory and gpu memory

suhaild · November 4, 2022, 12:26am

Has anyone done any work compressing the unet model in memory and/or quickly moving it from system memory to gpu memory?

I am finding it takes about 1.7s to move to system memory and 0.429s to gpu memory using .to(‘cpu’), .to(‘cuda’)

johnrobinsn · November 4, 2022, 12:39am

Don’t you just do it once? At init time?

suhaild · November 4, 2022, 12:50am

I intend to swap in and out different unets