I am always trying to get a better overview of publicly available data sets but there just seem to be too many. Maybe this github repo is the right format to organize all the datasets:
Thanks for the GitHub link!
I stumbled over this interesting article about the launch of google dataset search:
Let’s see if this helps.