leonardlin's picture
Update README.md
ff8b803 verified
metadata
license: apache-2.0
language:
  - ja
  - en
base_model:
  - cyberagent/Mistral-Nemo-Japanese-Instruct-2408
tags:
  - gptq

W8A8-INT8 GPTQ + SmoothQuant quant of cyberagent/Mistral-Nemo-Japanese-Instruct-2408 w/ LLM Compressor 0.4.0 using augmxnt/ultra-orca-boros-en-ja-v1 as calibration set