Non-granular level tabular data

max3xis · December 31, 2019, 1:01am

Hello, I’ve ran into the issue with tabular data. All of the examples listed have some sort of one-item per row. As for me, i have aggregated data like in the example:

item count | average price | item name
1 | 5 | bag
4 | 2 | box
2 | 3 | bag

Obviously, we should somehow increase the importance of the rows so that row 3 has to be twice more important than row 1. However i didn’t find any info on how to increase the learning rate for those accordingly and I dont want to just break that info into several different rows without adding any extra value. any ideas on how to solve this?

thanks a lot.

knesgood · December 31, 2019, 4:05pm

Can you clarify a bit on what you’re trying to predict? It might help us troubleshoot / brainstorm on a solution.

max3xis · December 31, 2019, 10:18pm

in this example im trying to predict the price for the Item Name. in this case, “Bag” has two enties where the second one has two items with average price of 3, so it has to be twice as important compared to the first one.

knesgood · January 1, 2020, 6:40pm

You’re predicting average price, correct?

max3xis · January 2, 2020, 3:52am

yes, thats correct. Just in case: Im simplifying the real task as in reality im working on a hybrid nn working both with text and tabular data, but this is so far the only complication I ran into.

knesgood · January 6, 2020, 6:13pm

I’m prefacing this with I’m sure I’m being slow here. Why would you need to add weight to the line item? What information will you have at time of inference? If it’s the item name and a quantity, the network should be able to handle those to come up with the average price? Have you given this a try with no feature engineering/massaging?

Again, I’m sure I’m missing something blatantly obvious, so my apologies in advance

max3xis · January 24, 2020, 8:27am

sorry, didnt notice you replied. I will give it a try “as is” but im 100% sure this should be a weight =)