Lesson 14 in-class

In the part 1 course, it was recommended RNN-GRU for Time Series compared to LSTM. Is this still true from a general Time Series analysis perspective.

Just to get some intuition here – is the assumption that if “time until next Promotion” is very low, sales will be depressed – because everyone is just waiting for the next upcoming sale?

@renjithmadhavan RNN is recommended for time series where the absolute day/week/date doesn’t matter much (like the pediatric ICU data mentioned earlier), but embeddings are recommended for structured data where day/week/date matters (like sales data)

4 Likes

@resdntalien that is a reasonable hypothesis

1 Like

Is there any issue in fitting the preprocessors on the full train and validation data? Should they be fit only on the training set?

For the normalization, is it possible to use another function (than log, sigmoid) ?

When we log transform our target variable, we are also transforming the squared error. Is this a problem? Or is it actually helping the model to find a better minimum error in the un-transformed space?

1 Like

so what does MAPE with neural network mean? I would have expected the result to be the same, but why is it lower (0.093)?

So the prefix is not fed into an RNN? Its just fed into a dense layer?

Is this a regression problem or classification? Are they predicting GPS coordinate numbers and error calculated as a distance from the destination?

1 Like

This p_i * c_i is very similar to what happens in the memory network paper (Babi memn2n). In that case, the output embeddings are weighted by the “attention” probability vector.

Can you talk more about the idea you have about first having the convolutional layer and passing that to an RNN? What do you mean exactly by that?

@thunderingtyphoons regression problem. yes, they’re predicting GPS coordinates

1 Like

So for the destinations we would have more error for the peripheral points? As we are taking a centroid of clusters?

Is Jeremy going to talk about segmentation tonight?

1 Like

For such types of problems(segmentation), what does the Y look like?
Is it the class number for each pixel for the whole image?

@layla.tadjpour the example he is currently doing is segmentation

Why crop it into 224x224? Are we using VGG for pre-training?

1 Like

@harveyslash yes, the output has a category number in the place of each pixel

1 Like

What about “The One Hundred Layers Tiramisu” paper ? Last week he said he would talk about that after Dense net.