alexmarques commited on
Commit
625e5e5
·
verified ·
1 Parent(s): 3540dcb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -3
README.md CHANGED
@@ -134,9 +134,7 @@ The model was evaluated on the [OpenLLM](https://huggingface.co/spaces/open-llm-
134
  ```
135
  lm_eval \
136
  --model vllm \
137
- --model_args pretrained="neuralmagic/Qwen2.5-7B-Instruct-quantized.w4a16",dtype=auto,gpu_memory_utilization=0.5,max_model_len=4096,enable_chunk_prefill=True,tensor_parallel_size=1 \
138
- --apply_chat_template \
139
- --fewshot_as_multiturn \
140
  --tasks openllm \
141
  --batch_size auto
142
  ```
 
134
  ```
135
  lm_eval \
136
  --model vllm \
137
+ --model_args pretrained="neuralmagic/Qwen2.5-7B-Instruct-quantized.w4a16",dtype=auto,gpu_memory_utilization=0.5,max_model_len=4096,add_bos_token=True,enable_chunk_prefill=True,tensor_parallel_size=1 \
 
 
138
  --tasks openllm \
139
  --batch_size auto
140
  ```