gokulnk reading list notes

reinforcment learning using human feedback

in LLMs conversation is the finetuning using human feedback

All notes

gokulnk readinglist notes

© 2026, Site By @gokulnk