Chinese-Alpaca-2-1.3B-RLHF

This repository contains Chinese-Alpaca-2-1.3B-RLHF, which is tuned on Chinese-Alpaca-2-1.3B with RLHF using DeepSpeed-Chat.

For non-RLHF model, please see: https://huggingface.co/hfl/chinese-alpaca-2-1.3b

Please refer to https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/ for more details.

Downloads last month
137
Safetensors
Model size
1.26B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for hfl/chinese-alpaca-2-1.3b-rlhf

Quantizations
1 model