Retro transformer model with fast.ai

uw198162 · January 19, 2023, 10:13am

Hi!

I’m convinced that language models of the type proposed by the Deepmind Retro paper ([2112.04426] Improving language models by retrieving from trillions of tokens) will quickly win over “blind” LLMs.

Has anyone here researched or implemented a retrieval-based transformer LM using fastai?