Thread for Blogs (Just created one for ResNet)

kcturgutlu · November 24, 2017, 7:22am

I’ve made some changes regarding to your suggestions. But there may be some mistakes or wrong interpretation. Please let me know when you have time to take a look at it. Thank you so much !

PranY · November 24, 2017, 8:33am

Hello everyone,

Here is my contribution in spreading the knowledge.

You can review the 3 most recent posts. Please share your feedback so that I can update posts.

My idea is to create a series of 7 posts covering all possible architectures mentioned in post 1. After completing these 7 posts, I’ll re-visit each of them in the same order, but this time with codes (pytorch, tensorflow) and implementation.

In the meantime, I’m trying to understand pytorch better, so that I can contribute to fastai library development. I would like to request @jeremy to create a dev branch for any feature development. As of now, I am confused between branches and most of the activity occurs on master branch.

Last but not the least, I request you all to suggest possible changes which can better the outcome of this series of posts, which is, “A deeper understanding of Neural Networks”

jeremy · November 24, 2017, 4:53pm

Ah OK well your understanding is exactly right. I may have read something into your text that wasn’t there, but I kinda thought you were saying 1-hot encoding had some fundamental deficiency in terms of what it could represent.

kcturgutlu · November 24, 2017, 8:36pm

Yeah I made some changes maybe that was it, thank you so much for your help. Ok now the post is public https://medium.com/@keremturgutlu/structured-deep-learning-b8ca4138b848 and my twitter handle is @KeremTurgutlu.

Thanks !

reshama · November 25, 2017, 12:02am

I’ve created a list of blogs that Jeremy has gone over in the lessons.

If any of these blogs have been written by women, can you let me know? I would like to tweet it out from my Women in Machine Learning & Data Science @wimlds twitter handle. Thanks.

If I’ve forgotten any blogs, or you notice any typos, I would be happy to update.

narvind2003 · November 25, 2017, 12:11am

Thanks @reshama for compiling this!

krishnakalyan3 · November 25, 2017, 2:33am

Hello All,
I just wrote my first blog on embedding

I would love to have your feedback.

jeremy · November 25, 2017, 2:40am

@krishnakalyan3 thanks for sharing A couple of things that could improve this:

Use something like Office Lens to redo your photos. It will clean up the contrast and make them much easier to see. Or alternatively, since you’re mainly showing tables, instead you could create the tables in a spreadsheet and format them nicely, and then take a screenshot of that part of the screen. That can look great!
Your description makes it sound like a bit like embeddings handle ordinal variables directly. Perhaps it would be helpful to show how embeddings are actually basically doing one-hot encodings “behind the scenes”
It would be nice to show examples of how well they work, or what results they create. Check out some pictures from kaggle winner posts or papers, for instance

krishnakalyan3 · November 25, 2017, 2:43am

Thanks for the feedback

init_27 · November 25, 2017, 6:10am

@everyone
I have created a weekly-ish newsletter where I’ll be sending the cool DL, CV articles/resources that I stumble across every week. I wanted to ask you guys if it’s okay that I share your articles in that?

miguel_perez · November 25, 2017, 3:04pm

Very instructive, nice to read and interesting!

About the One Hot encoding assumptions, I also read it twice, and I think I found the -possible- issue; From the post, the sentence that begins it all is:

If we one-hot encode or arbitrarily label encode this variable (…)

So two encodings mixed in one sentence. Label encoding does assume equality of distances in ordered levels, or arbitrary distances in nominals. So I would understand this assumption is made by label encoding, not one hot encoding. The way I understand it OHE is just an embedding of dimension one.

Anyway, thanks for the great post, really worth reading!

kcturgutlu · November 25, 2017, 10:42pm

Thanks for catching it! What I tried to emphasize is that with ohe all pair of levels are assumed to be equally similar or dissimilar or distant, where as in label encoding it’s even worse since for example Monday comes just after Sunday but they maybe 7 levels apart. What embeddings allow us is that they learn about level representation vectors so that similar and dissimilarities can be captured in Eucledian space. I might not reflect this sentence correctly I guess. But still thanks for liking it ! I appreciate

kcturgutlu · November 26, 2017, 5:58am

What happens when you accept your medium post to be shared in a channel ? “Towards Data Science” channel invited me to share my article but I don’t know the details of it, should I accept it or not ? Are there any consequences of this situation? Thanks

narvind2003 · November 26, 2017, 6:15am

Accept it…more people will see your post.

kcturgutlu · November 26, 2017, 6:19am

Ah ok then, I am overly skeptical with these kind of things

manikanta_s · November 26, 2017, 6:23am

I have added my article to there publication. It went well. But just as a note they will have control over your article along with you. They can make it member only if they wish to. They can edit and do changes to your article. But they didn’t do anything above so I see no issues

init_27 · November 26, 2017, 9:13am

Here is an attempt by me to explain using pretrained networks to predict Sentiment on the IMDB reviews.

Please let me know what do you think about it.
I could really use your guidance if you want to correct me anywhere

Regards,
Sanyam Bhutani

jeremy · November 26, 2017, 3:29pm

You have to know the reputation of the publication. ‘Towards Data Science’ seems quite OK

init_27 · November 27, 2017, 10:08am

A short post on Gradient Clipping.

bushaev · November 27, 2017, 6:17pm

Hey guys! Would appreciate any feedback on last post about gradient descent!