Lesson 5 In-Class Discussion

yinterian · November 28, 2017, 3:11am

https://www.quora.com/What-are-hyperparameters-in-machine-learning

lindarrrliu · November 28, 2017, 3:11am

Do we need to check in from romote?

yinterian · November 28, 2017, 3:12am

Send an email to Mindi and Leslie

cstorm125 · November 28, 2017, 3:12am

A note on collaborative filtering. I’ve found that most people in the industry is using Spark or other distributed framework to do it because in most cases it’s a huge matrix decomposition (100M products x 10M customers for instance). Would we be able to use fastai directly or is there a way to customize that?

karthikramesh · November 28, 2017, 3:13am

How is this different from torch.mm(a,b)?

abdel · November 28, 2017, 3:15am

I don’t think they’re any different, but Jeremy mentioned he wanted to try to avoid using more abstract libraries if possible… so implementing it kind of from scratch to show the intuition behind it

mindtrinket · November 28, 2017, 3:15am

International fellows need to check in remotely?

yinterian · November 28, 2017, 3:16am

No, you don’t have to.

anandsaha · November 28, 2017, 3:16am

Yes that’s what I was thinking. a*b gives the element-wise product, not dot product. dot product would be torch.mm()

ar_ai · November 28, 2017, 3:17am

Didn’t he sum it to make it dot product? I think it is similar to torch.mm().

anandsaha · November 28, 2017, 3:20am

The sum gave us a 2x1 matrix.

Dot product of 2x2 and 2x2 matrix should give us 2x2 matrix.

pete.condon · November 28, 2017, 3:20am

where does n_factors come from in init()?

travisleleu · November 28, 2017, 3:21am

it’s a global variable in this notebook

pete.condon · November 28, 2017, 3:22am

Thanks, seems like an odd way of passing it in … but I guess it’s not a huge problem.

vikbehal · November 28, 2017, 3:24am

@yinterian Do you have the link to the blog used for initialization?

karthikramesh · November 28, 2017, 3:25am

Yes the dimensions don’t match

jenna · November 28, 2017, 3:25am

travisleleu · November 28, 2017, 3:25am

I agree, but I think it’s a consequence of the gap between working in a notebook for exploratory work, then packaging code up for reusability, deployment, and other software engineering goodness.

Jeremy (I think) tweeted out a link to Jake Vanderplas’ series called “Reproducable Data Analysis in Jupyter” that shows a reasonable workflow to move from one to the other. https://www.youtube.com/watch?v=_ZEWDGpM-vM&list=PLYCpMb24GpOC704uO9svUrihl-HY1tTJJ

yinterian · November 28, 2017, 3:27am

It may be this one

jenna · November 28, 2017, 3:27am

Could class EmbeddingDot reuse the DotProduct class from before?