Learning Deep Reinforcement Learning

xariusdrake · February 3, 2023, 9:48am

The last four days I learned: fixed my reward model in RLHF, how to train the reward model

xariusdrake · February 5, 2023, 9:06am

The last two days i learned: an overview of controlling a robot using a language model

Diako · February 12, 2023, 12:25pm

“Hello! I have been enjoying some of your content on Twitch and I was curious if you could share your flashcards with me. I understand that you use Remnote, but is it possible to convert them to the Anki format?”

xariusdrake · February 12, 2023, 10:03pm

Hey. Thanks for watching !! What subject would you like to share? I can share them all if you’d like.

Diako · February 13, 2023, 1:12am

Sending all would be great. I’m particularly interested in your implementation of reinforcement learning and robotics papers.

xariusdrake · February 13, 2023, 8:27am

Here you go. Anki format

https://drive.google.com/file/d/1OkVm4IavaJ7_270hkFRzbFa1nakBGZew/view?usp=sharing

Diako · February 13, 2023, 8:55am

Thank you. I requested for access.

xariusdrake · February 13, 2023, 9:30am

Oh sorry. Just updated the link above

xariusdrake · February 14, 2023, 9:38am

Check out my RLHF implementation at GitHub - xrsrke/instructGOOSE: Implementation of Reinforcement Learning from Human Feedback (RLHF). For the robot paper, it’s not currently my top priority, so you’ll see that I’m making slow progress on it