Lesson 14 in-class

renjithmadhavan · April 11, 2017, 2:32am

In the part 1 course, it was recommended RNN-GRU for Time Series compared to LSTM. Is this still true from a general Time Series analysis perspective.

resdntalien · April 11, 2017, 2:33am

Just to get some intuition here – is the assumption that if “time until next Promotion” is very low, sales will be depressed – because everyone is just waiting for the next upcoming sale?

rachel · April 11, 2017, 2:35am

@renjithmadhavan RNN is recommended for time series where the absolute day/week/date doesn’t matter much (like the pediatric ICU data mentioned earlier), but embeddings are recommended for structured data where day/week/date matters (like sales data)

rachel · April 11, 2017, 2:35am

@resdntalien that is a reasonable hypothesis

ostegm · April 11, 2017, 2:51am

Is there any issue in fitting the preprocessors on the full train and validation data? Should they be fit only on the training set?

Kjeanclaude · April 11, 2017, 2:55am

For the normalization, is it possible to use another function (than log, sigmoid) ?

dennisobrien · April 11, 2017, 2:57am

When we log transform our target variable, we are also transforming the squared error. Is this a problem? Or is it actually helping the model to find a better minimum error in the un-transformed space?

Surya501 · April 11, 2017, 3:20am

so what does MAPE with neural network mean? I would have expected the result to be the same, but why is it lower (0.093)?

hamelsmu · April 11, 2017, 3:24am

So the prefix is not fed into an RNN? Its just fed into a dense layer?

thunderingtyphoons · April 11, 2017, 3:25am

Is this a regression problem or classification? Are they predicting GPS coordinate numbers and error calculated as a distance from the destination?

amanmadaan · April 11, 2017, 3:26am

This p_i * c_i is very similar to what happens in the memory network paper (Babi memn2n). In that case, the output embeddings are weighted by the “attention” probability vector.

hamelsmu · April 11, 2017, 3:27am

Can you talk more about the idea you have about first having the convolutional layer and passing that to an RNN? What do you mean exactly by that?

rachel · April 11, 2017, 3:28am

@thunderingtyphoons regression problem. yes, they’re predicting GPS coordinates

prabu · April 11, 2017, 3:28am

So for the destinations we would have more error for the peripheral points? As we are taking a centroid of clusters?

layla.tadjpour · April 11, 2017, 3:44am

Is Jeremy going to talk about segmentation tonight?

harveyslash · April 11, 2017, 3:44am

For such types of problems(segmentation), what does the Y look like?
Is it the class number for each pixel for the whole image?

rachel · April 11, 2017, 3:45am

@layla.tadjpour the example he is currently doing is segmentation

thejaswi.hr · April 11, 2017, 3:45am

Why crop it into 224x224? Are we using VGG for pre-training?

rachel · April 11, 2017, 3:46am

@harveyslash yes, the output has a category number in the place of each pixel

layla.tadjpour · April 11, 2017, 3:47am

What about “The One Hundred Layers Tiramisu” paper ? Last week he said he would talk about that after Dense net.