Multiple files of tabular data

pbender · July 17, 2019, 8:48pm

Hey everyone, I’m messing around with the CHAMPS molecular properties dataset and I was wondering what is the best way for dealing with multiple files of tabular data? Should I put them each in their own dataframe and pass them into the databunch? I tried passing them as an array of dataframes but that didn’t work. Is it even possible to pass in multiple dataframes? I can’t append them into one csv file because the train.csv is organized by bond and the rest are organized by atom.

pbender · July 17, 2019, 9:58pm

Update: I found this kernel on the kaggle page and copied what he did (under the label “pre-process data”). Turns out you totally can smush everything into one csv file.