A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF).
Lewis Tunstall PRO
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
liked
a model
about 9 hours ago
Qwen/Qwen3-30B-A3B-Instruct-2507
upvoted
an
article
about 16 hours ago
Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era
new activity
about 22 hours ago
Qwen/Qwen3-4B-Instruct-2507:Sampling parameters to tau2-bench?