
The beautiful humans of HackerNoon have collectively read @languagemodels's 49 stories for 5 days 1 hours and 50 minutes.
#Interests
reinforcement-learning
in-context-learning
preference-learning
large-language-models
reward-functions
rlhf-efficiency
in-context-preference-learning
human-in-the-loop-rl