Augmenting collaborative filtering with text for cold start

I’m looking to match two pieces of text - e.g. IMDb movie descriptions and each person’s description of the type of movies they like. I have an existing set of ~5000 matches between the two. I particularly want to overcome the cold-start problem: what movies to recommend to a new user? When a new movie comes out, to which users should it be recommended? I see two options:

  1. Run each description of a person through a text learner; do the same for each movie description; concatenate the results for some subset of possible combinations of people and movies, and attach to a dense net to then predict whether it’s a match or not
  2. Attempt to augment collaborative filtering with the output from running the movie description and person description through a text learner.

Are these tractable approaches?