Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off Paper • 2508.04825 • Published 15 days ago • 56
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published 10 days ago • 38
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8 • 26
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1 • 229
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs Paper • 2506.14429 • Published Jun 17 • 45
view article Article Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes By nvidia and 2 others • Jun 4 • 21
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published May 12 • 81
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 509
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7 • 66
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published Mar 6 • 72
view article Article Visualize and understand GPU memory in PyTorch By qgallouedec • Dec 24, 2024 • 239
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published Feb 25 • 28
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other • Feb 25 • 172
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20 • 175
NexaQuant Models Collection NexaQuant compresses models with 100% accuracy recovery. • 2 items • Updated about 1 month ago • 6