Hi All,
I have written detailed blog post about implementing self attention from scratch using PyTorch. I have tried to keep things simple and jargon free with step by step explanation.
The primary objective of this article is to follow @jeremy advice write a blog post about your learning and explain a concept in simple words.
Here is a link, do read and let me know your feedback (both positive and negative).