Arcee's MergeKit: A Toolkit for Merging Large Language Models Paper • 2403.13257 • Published Mar 20, 2024 • 20
Model Stock: All we need is just a few fine-tuned models Paper • 2403.19522 • Published Mar 28, 2024 • 13
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models Paper • 2410.01335 • Published Oct 2, 2024 • 5
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic Paper • 2509.01363 • Published 5 days ago • 27