view article Article From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning By NormalUhr • 19 days ago • 11
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 170
LLM papers Collection It is a collection of papers that are useful in studying LLM. • 14 items • Updated Apr 3, 2024 • 12
Foundation AI Papers Collection Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 30