Universal Deep Research: Bring Your Own Model and Strategy Paper • 2509.00244 • Published 8 days ago • 10
Universal Deep Research: Bring Your Own Model and Strategy Paper • 2509.00244 • Published 8 days ago • 10 • 1
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published Apr 17 • 93
Minifinetuning: Low-Data Generation Domain Adaptation through Corrective Self-Distillation Paper • 2506.15702 • Published May 30
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation Paper • 2410.01680 • Published Oct 2, 2024 • 36
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper • 2409.17481 • Published Sep 26, 2024 • 48
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58
Compact Language Models via Pruning and Knowledge Distillation Paper • 2407.14679 • Published Jul 19, 2024 • 39