Don’t bother looking … the problem is with the FastAPI framework. See the solution here.
In a nutshell, it doesn’t like tensor objects … but instead of throwing an exception indicating it can’t return them as part of the response, it just returns them as {}. Bonkers.
Your initial fix with setting num_workers=0 seems to have done the trick after all!