Reinforcement Learning from Human Feedback (RLHF) in Notebooks github.com 71 points by ash_at_hny 16 hours ago
Hl
[dead]
[dead]