I am training a tabular model for health prediction, and we have what we call a disease block of about 10-15 diseases with a disease severity as a ordinal. For example, one of the diseases in the block might be stroke, and you could have 0=none, 1=mild, 2=medium, 3=severe. Was originally thinking of a multicategory block to model, but code does not seem to be deal with the ordinality. Plus Multicategorical for a 15 disease block has 2^15 choices. Thats huge.
So i thought rather than do it that way, a nlp style vocab of 15 diseases, so that one patient might have something like this (kinda like Term Frequency but with Ordinals):
301011121000000
.
Before i go off and write something, I was wondering if there is built in support for this?