Last night I read Jeremy’s post on one ML study: http://www.fast.ai/2017/09/13/kosinski/
I totally agree with the content of this post, just have a couple of questions:
-
“Specifically, each image was turned into 4096 numbers, each of which had been trained by University of Oxford researchers to be as good as possible for recognizing humans from their faces.” What are these 4096 numbers?
-
"__They compressed those 4096 numbers down 500 using a simple statistical technique called SVD._… " Does this step improve the model accuracy? Or it only reduce the computation resources requirement?
-
“…and they then used a simple regression model to map these 500 numbers to the label (gay or not).” I think the CNN models can be directly used for image classification, right? Why do they bother to use another regression model?
Thanks!