Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AXERA-TECH
/
Qwen2.5-0.5B-Instruct-GPTQ-Int4

Text Generation
Transformers
qwen2
Qwen
Qwen2.5-0.5B-Instruct
Qwen2.5-0.5B-Instruct-GPTQ-Int4
GPTQ
Int4
4-bit precision
gptq
Model card Files Files and versions
xet
Community
Qwen2.5-0.5B-Instruct-GPTQ-Int4
Ctrl+K
Ctrl+K
  • 2 contributors
History: 10 commits
wli1995's picture
wli1995
delete old qwen2.5_tokenizer.py
b50c751 verified about 10 hours ago
  • qwen2.5-0.5b-gptq-int4-ax650
    update axmodel and demo about 11 hours ago
  • qwen2.5_tokenizer
    Upload 13 files 6 months ago
  • .gitattributes
    1.85 kB
    update axmodel and demo about 11 hours ago
  • README.md
    7.43 kB
    update axmodel and demo about 11 hours ago
  • config.json
    1.26 kB
    update axmodel and demo about 11 hours ago
  • main_ax650
    985 kB
    xet
    update axmodel and demo about 11 hours ago
  • main_axcl_aarch64
    1.73 MB
    xet
    update axmodel and demo about 11 hours ago
  • main_axcl_x86
    8.42 MB
    xet
    update axmodel and demo about 11 hours ago
  • main_prefill
    954 kB
    xet
    Upload 13 files 6 months ago
  • post_config.json
    277 Bytes
    Upload 13 files 6 months ago
  • qwen2.5_tokenizer_uid.py
    7.21 kB
    update axmodel and demo about 11 hours ago
  • run_qwen2.5_0.5b_gptq_int4_ax650.sh
    532 Bytes
    update axmodel and demo about 11 hours ago
  • run_qwen2.5_0.5b_gptq_int4_axcl_aarch64.sh
    464 Bytes
    update axmodel and demo about 11 hours ago
  • run_qwen2.5_0.5b_gptq_int4_axcl_x86.sh
    460 Bytes
    update axmodel and demo about 11 hours ago