Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models Paper • 2508.21365 • Published 10 days ago • 23
TALKPLAY: Multimodal Music Recommendation with Large Language Models Paper • 2502.13713 • Published Feb 19 • 3
gradientai/Llama-3-8B-Instruct-Gradient-1048k Text Generation • 8B • Updated Oct 29, 2024 • 54.1k • 679
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 336