DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 97
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering Paper • 2405.15793 • Published May 6, 2024 • 4
Group Robust Preference Optimization in Reward-free RLHF Paper • 2405.20304 • Published May 30, 2024 • 1
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper • 2411.07975 • Published Nov 12, 2024 • 30
HunyuanVideo: A Systematic Framework For Large Video Generative Models Paper • 2412.03603 • Published Dec 3, 2024 • 7
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 330
Code Evaluation Collection Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation Paper • 2102.04664 • Published Feb 9, 2021 • 2
Classical Sorting Algorithms as a Model of Morphogenesis: self-sorting arrays reveal unexpected competencies in a minimal model of basal intelligence Paper • 2401.05375 • Published Dec 15, 2023 • 1
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models Paper • 2410.20771 • Published Oct 28, 2024 • 3
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 92
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Paper • 2006.11477 • Published Jun 20, 2020 • 6