File size: 410 Bytes
d8b1c6e |
1 2 3 4 5 6 7 8 9 10 11 12 13 |
---
language:
- en
license: llama3.1
base_model: cognitivecomputations/Dolphin3.0-Llama3.1-8B
base_model_relation: quantized
library_name: mlc-llm
pipeline_tag: text-generation
---
4-bit [OmniQuant](https://arxiv.org/abs/2308.13137) quantized version of [Dolphin3.0-Llama3.1-8B](https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.1-8B) for inference with [Private LLM](https://privatellm.app).
|