File size: 410 Bytes
d8b1c6e
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
---
language:
  - en
license: llama3.1
base_model: cognitivecomputations/Dolphin3.0-Llama3.1-8B
base_model_relation: quantized
library_name: mlc-llm
pipeline_tag: text-generation
---

4-bit [OmniQuant](https://arxiv.org/abs/2308.13137) quantized version of [Dolphin3.0-Llama3.1-8B](https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.1-8B) for inference with [Private LLM](https://privatellm.app).