How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published 3 days ago • 66 • 8
ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation Paper • 2502.13581 • Published 5 days ago • 5 • 3
Large Language Models and Mathematical Reasoning Failures Paper • 2502.11574 • Published 7 days ago • 3 • 3
We Can't Understand AI Using our Existing Vocabulary Paper • 2502.07586 • Published 12 days ago • 8 • 4
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 14 days ago • 32 • 3
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Paper • 2502.05003 • Published 16 days ago • 41 • 3
Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression Paper • 2502.04296 • Published 17 days ago • 6 • 3
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach Paper • 2502.03639 • Published 18 days ago • 8 • 3
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published 20 days ago • 9 • 6
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 25 days ago • 24 • 3
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Paper • 2501.18837 • Published 24 days ago • 9 • 5
Unraveling the Capabilities of Language Models in News Summarization Paper • 2501.18128 • Published 25 days ago • 3 • 3
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Paper • 2501.16411 • Published 27 days ago • 18 • 3
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published 24 days ago • 19 • 4
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 25 days ago • 23 • 3
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper • 2410.05993 • Published Oct 8, 2024 • 108 • 7