Audio to Audio Voice Cloning


I was looking at text to speech voice cloning and ws wondering if it’s possible to do a direct audio to audio voice cloning without relying on text. So if you have voiceA and voiceB, voiceA says a sentence and then it’s converted into voiceB while preserving the way the sentence was said.