Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Paper • 2502.13092 • Published 5 days ago • 12
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Paper • 2502.10391 • Published 9 days ago • 29
huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2 Text Generation • Updated 8 days ago • 5.94k • 113
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated 15 days ago • 1.04M • • 1.15k
NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation • Updated 5 days ago • 6.15k • 260
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published 12 days ago • 43