Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published 18 days ago • 13
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 26 days ago • 106
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 330
Can MLLMs Understand the Deep Implication Behind Chinese Images? Paper • 2410.13854 • Published Oct 17, 2024 • 11
Can MLLMs Understand the Deep Implication Behind Chinese Images? Paper • 2410.13854 • Published Oct 17, 2024 • 11
Can MLLMs Understand the Deep Implication Behind Chinese Images? Paper • 2410.13854 • Published Oct 17, 2024 • 11 • 2
CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling Paper • 2405.16433 • Published May 26, 2024 • 1
GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability Paper • 2403.04483 • Published Mar 7, 2024 • 1
CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations Paper • 2405.10212 • Published May 16, 2024 • 1
CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations Paper • 2405.10212 • Published May 16, 2024 • 1
GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability Paper • 2403.04483 • Published Mar 7, 2024 • 1