Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Replicate
Nebius AI Studio
fal
Novita
Together AI
SambaNova
Hyperbolic
HF Inference API
Misc
Reset Misc
arxiv:
2410.17215
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
Misc with no match
Eval Results
Merge
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
24
Full-text search
Edit filters
Sort: Trending
Active filters:
2410.17215
Clear all
MiniLLM/MiniPLM-Qwen-200M
Text Generation
•
Updated
Oct 27, 2024
•
285
•
2
MiniLLM/MiniPLM-Mamba-130M
Text Generation
•
Updated
Oct 27, 2024
•
108
•
2
MiniLLM/MiniPLM-llama3.1-212M
Text Generation
•
Updated
Oct 27, 2024
•
936
•
2
MiniLLM/MiniPLM-Qwen-1.2B
Text Generation
•
Updated
Oct 27, 2024
•
185
•
2
MiniLLM/MiniPLM-Qwen-500M
Text Generation
•
Updated
Oct 27, 2024
•
237
•
5
MiniLLM/Pretrain-Qwen-1.2B
Text Generation
•
Updated
Oct 27, 2024
•
170
MiniLLM/Pretrain-Qwen-500M
Text Generation
•
Updated
Oct 27, 2024
•
184
MiniLLM/Pretrain-Qwen-200M
Text Generation
•
Updated
Oct 27, 2024
•
575
MiniLLM/VanillaKD-Pretrain-Qwen-200M
Text Generation
•
Updated
Oct 27, 2024
•
172
MiniLLM/VanillaKD-Pretrain-Qwen-500M
Text Generation
•
Updated
Oct 27, 2024
•
173
MiniLLM/VanillaKD-Pretrain-Qwen-1.2B
Text Generation
•
Updated
Oct 27, 2024
•
167
RichardErkhov/MiniLLM_-_MiniPLM-Qwen-1.2B-gguf
Updated
Oct 27, 2024
•
177
MiniLLM/Ref-Pretrain-Qwen-104M
Text Generation
•
Updated
Oct 27, 2024
•
924
•
2
RichardErkhov/MiniLLM_-_Pretrain-Qwen-1.2B-gguf
Updated
Nov 1, 2024
•
437
RichardErkhov/MiniLLM_-_MiniPLM-Qwen-200M-gguf
Updated
Nov 3, 2024
•
197
RichardErkhov/MiniLLM_-_MiniPLM-Qwen-500M-gguf
Updated
Nov 3, 2024
•
174
RichardErkhov/MiniLLM_-_MiniPLM-Qwen-500M-awq
Updated
Dec 3, 2024
•
7
RichardErkhov/MiniLLM_-_MiniPLM-llama3.1-212M-awq
Updated
Dec 6, 2024
•
7
RichardErkhov/MiniLLM_-_Pretrain-Qwen-500M-exl2
Updated
Jan 18
RichardErkhov/MiniLLM_-_VanillaKD-Pretrain-Qwen-500M-exl2
Updated
Jan 20
RichardErkhov/MiniLLM_-_MiniPLM-Qwen-500M-exl2
Updated
Jan 20
RichardErkhov/MiniLLM_-_MiniPLM-llama3.1-212M-gguf
Updated
14 days ago
•
540
RichardErkhov/MiniLLM_-_Ref-Pretrain-Qwen-104M-gguf
Updated
12 days ago
•
407
RichardErkhov/MiniLLM_-_VanillaKD-Pretrain-Qwen-1.2B-gguf
Updated
1 day ago
•
379