Could you please provide a 16-bit version or teach me the steps involved in converting it myself

by zletpm - opened Apr 10

Apr 10

•

Thank you for the 8bit txtonly version of mistral, the model perform well in mac studio for txt generation task, excced the official or customized 8bit mutimodel version in genaration speed, i wonder if there are 16bit could be much better as the balance of accuracy and speed.

Could you please provide a 16-bit version or teach me the steps involved in converting it myself.

thank you!

zletpm

Apr 10

Execute the following code step by step in a virtual environment:
pip install mlx-lm==0.22.3
python -m mlx_lm.convert --hf-path anthracite-core/Mistral-Small-3.1-24B-Instruct-2503-HF
Finally, you will get the model with 16-bit MLX version.

zletpm changed discussion status to closed Apr 10

zletpm changed discussion status to open Apr 11

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment