gokulnk reading list notes

for the models to train

llama is trained on 15T tokens
public github repos less than 1T tokens

mitigation

synthetic data
self play
RL approaches
1. reinforcement learning

All notes

gokulnk readinglist notes