neuralmagic/Llama-2-7b-ultrachat200k-pruned_50-quantized-deepsparse Text Generation • Updated May 7, 2024 • 21
neuralmagic/Llama-2-7b-ultrachat200k-pruned_70-quantized-deepsparse Text Generation • Updated May 15, 2024 • 18
neuralmagic/Llama-2-7b-evol-code-alpaca-pruned_50-quantized-deepsparse Text Generation • Updated May 15, 2024 • 21
neuralmagic/Llama-2-7b-evol-code-alpaca-pruned_70-quantized-deepsparse Text Generation • Updated May 15, 2024 • 18
neuralmagic/Llama-2-7b-dolphin-open_platypus-pruned_50-quantized-deepsparse Text Generation • Updated May 16, 2024 • 18
neuralmagic/Llama-2-7b-dolphin-open_platypus-pruned_70-quantized-deepsparse Text Generation • Updated May 16, 2024 • 16 • 1
RichardErkhov/neuralmagic_-_Llama-2-7b-evolcodealpaca-4bits Text Generation • Updated May 10, 2024 • 78
RichardErkhov/neuralmagic_-_Llama-2-7b-evolcodealpaca-8bits Text Generation • Updated May 10, 2024 • 8
RichardErkhov/neuralmagic_-_Llama-2-7b-dolphin-open_platypus-pruned_70-gguf Updated Jul 16, 2024 • 40
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 41 • 1
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16 Text Generation • Updated Dec 19, 2024 • 122 • 3