Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 2 days ago • 16
Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training Paper • 2511.01918 • Published Nov 1, 2025 • 11
EchoLLaMA: 3D-to-Speech with Multimodal AI Collection This collection contains the models and datasets used in EchoLLaMA: 3D-to-Speech with Multimodal AI paper. • 4 items • Updated Apr 7, 2025 • 4
Model Stock: All we need is just a few fine-tuned models Paper • 2403.19522 • Published Mar 28, 2024 • 13
Harezmi-25 Collection Harezmi-25 is the Turkish chess engine project. • 3 items • Updated Feb 4, 2025 • 2
Maestro Models Collection Maestro LLMs based on DeepSeek's distilled models • 2 items • Updated Apr 6, 2025 • 2
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published Jan 9, 2025 • 60
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20, 2025 • 657
FastLlama Collection A Faster and Higher-performing FastLlama Series • 4 items • Updated Dec 30, 2024 • 4