There are about 15000 images in the train set so it does take a while! You could break it down into chunks and then merge the dataframes and also by default from_dicoms uses the brain window so you may want to change that as well.
Ok, 500 took a bit over 11 minutes… I’ll go through Jeremy’s older notebook tomorrow and see if I can figure out what the difference is… that competition had 74k images and he was able to load up the metadata in under 15 minutes.
I’ve done my pre-processing on the dicom images and then saved the results as 16bit .tiff files.
but the problem that i’m having now is that loading the images as a DataBlock seems to automatically convert them to 8bit.
Anyone have any ideas on how to load the data as 16bit?
I could work of the dicom files directly, but the dataset is just too large to use on my machine so i spent quite some time already on getting this 16bit tiff data set