OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models
Paper
• 2308.13137 • Published
• 19
4-bit OmniQuant quantized version of Meta-Llama-3.1-8B-Instruct.
Base model
meta-llama/Llama-3.1-8BTotally Free + Zero Barriers + No Login Required