Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
fal
Nebius AI Studio
Hyperbolic
SambaNova
Novita
Together AI
Replicate
HF Inference API
Misc
Reset Misc
Quantized
Inference Endpoints
Misc with no match
AutoTrain Compatible
text-generation-inference
Eval Results
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
69
Full-text search
Edit filters
Sort: Trending
Active filters:
Quantized
Clear all
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft
Updated
Jan 13
•
231
•
3
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-65536-woft
Updated
Nov 18, 2024
•
22
•
2
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-65536-woft
Updated
Nov 18, 2024
•
11
•
5
ABX-AI/WizardLM-2-7B-GGUF-IQ-Imatrix
Updated
Apr 15, 2024
•
634
•
21
erdiari/turkish-quantized
Updated
Jun 5, 2024
•
21
•
2
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-32768-woft
Updated
Nov 18, 2024
•
15
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-65536-woft
Updated
Nov 18, 2024
•
45
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-4096-woft
Updated
Nov 18, 2024
•
28
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-256-woft
Updated
Nov 18, 2024
•
172
VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-65536-woft
Updated
Nov 18, 2024
•
11
•
4
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-65536-woft
Updated
Nov 18, 2024
•
19
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-256-woft
Updated
Nov 18, 2024
•
57
•
1
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-256-woft
Updated
Nov 18, 2024
•
50
VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-32768-woft
Updated
Nov 18, 2024
•
25
•
3
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k32768-0-woft
Updated
Nov 18, 2024
•
22
•
1
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-65536-woft
Updated
Nov 18, 2024
•
10
•
2
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k16384-0-woft
Updated
Nov 18, 2024
•
4
•
2
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-0-woft
Updated
Nov 18, 2024
•
58
•
2
SandLogicTechnologies/Llama-3.2-3B-Instruct-GGUF
Text Generation
•
Updated
Sep 26, 2024
•
21
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft-duplicated
Updated
Nov 18, 2024
•
12
•
1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-1024-woft
Updated
Nov 18, 2024
•
8
•
1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k4096-0-woft
Updated
Nov 18, 2024
•
9
•
1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-64-woft
Updated
Nov 18, 2024
•
18
•
3
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k32768-32768-woft
Updated
Nov 18, 2024
•
9
•
1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-128-woft
Updated
Nov 18, 2024
•
7
•
1
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft
Updated
Nov 18, 2024
•
11
•
2
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-0-woft
Updated
Nov 18, 2024
•
15
•
2
VPTQ-community/Qwen2.5-72B-Instruct-v8-k512-512-woft
Updated
Nov 18, 2024
•
14
•
1
VPTQ-community/Qwen2.5-72B-Instruct-v8-k1024-512-woft
Updated
Nov 18, 2024
•
11
•
2
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-256-woft
Updated
Nov 18, 2024
•
8
•
1
Previous
1
2
3
Next