Learning fastai part 2

xariusdrake · November 28, 2022, 5:06am

I’m currently learning fastai part 2 and documenting everything I learn along the way in this post. If you have any questions on these topics, please feel free to ask.

Github: GitHub - xrsrke/stable-diffusion-from-scratch: Implementation of Stable Diffusion from scratch

EDIT 1: My learning goal is to reimplement all components in stable diffusion from scratch. I will post it here!

EDIT 2: Added github

xariusdrake · November 28, 2022, 5:08am

28/11/002022, Lesson 12 - 11a_transfer_learning, 12a_awd_lstm

Reimplement transfer learning and LSTM cell from scratch

LSTM

SCR-20221128-dgo2664×1354 180 KB

SCR-20221128-dom1660×1582 306 KB

Transfer learning

xariusdrake · December 14, 2022, 9:57am

TIL: finally understand multi-head attention in transformer (after going back and forth for almost 1 month)

xariusdrake · December 16, 2022, 8:54am

TIL: implemented transformer’s encoder

xariusdrake · December 17, 2022, 7:09am

TIL: implemented masked attention

xariusdrake · December 20, 2022, 8:02am

TIL: how to calculate the similarity between two embeddings using open_clip and fastai’s Transform

p/s: i implemented the transformer from scratch, but I do not fully understand the src_mask and trg_mask in the Transformer’s forward pass. I will train it on a toy dataset using fastai to fully understand it

xariusdrake · December 22, 2022, 8:24am

To warm up my muscles for fastai’s 2023 course. Im currently implementing CLIP, DDPM, and VAE from scratch

TIL: understand how CLIP works (will implement from scratch very soon)

xariusdrake · December 26, 2022, 9:26am

Update: got the pipeline working (check out github). This week, the goal is to reimplement CLIP from scratch. I’m finding the Clip tokenizer to be a bit challenging

xariusdrake · December 31, 2022, 8:35am

the last few days i learned: some basics of transformers, einops

xariusdrake · January 2, 2023, 8:14am

the last two day i learned how text generation works in transformers, decoding strategies

xariusdrake · January 6, 2023, 8:40am

the last four days i learned: learned the pipeline of question-answering in NLP

xariusdrake · January 11, 2023, 8:34am

the last five days i learned: how text summarization works, train knowledge distillation, create performance benchmark

xariusdrake · January 14, 2023, 6:50am

the last three day i learned : implemented a custom head for a downstream task

jeremy · January 14, 2023, 11:35pm

You’re making great progress!

xariusdrake · January 15, 2023, 4:31am

Thanks Jeremy. I am going to post my learning progress on learning particle physics and nanoscience by re-implementing AI-related papers. Persistence is all you need

xariusdrake · January 18, 2023, 9:05am

the last four days, I learned: implemented the the language model agent in RLHF, prompt dataset

xariusdrake · January 22, 2023, 6:20am

The last four days i learned: implemented and trained GPT-2 from scratch

xariusdrake · January 24, 2023, 9:02am

the last two days i learned: figured out how to handle out of context length

xariusdrake · February 6, 2023, 8:07am

TIL: create a language model with a persistent memory for conversation using langchain

xariusdrake · February 28, 2023, 9:38am

the last two days i learned (lol this month i spent a lot of time for non-AI subjects, now i’m back): some techniques for efficiency train deeper model, some AI alignment techniques (will go deeper soon), 3/4 how ToolFormer works (will share notes after finish it), implemented 1/10 ToolFormer

Learning fastai part 2

LSTM SCR-20221128-dgo2664×1354 180 KB

SCR-20221128-dom1660×1582 306 KB

LSTM

SCR-20221128-dgo2664×1354 180 KB