Error in Readme.md

by kevinbayes - opened Sep 10, 2025

Sep 10, 2025

Just a note that the below needs the highlighted removed in the readme as it is mentioned twice and the first instance references prompt which does not exist:

from vllm import LLM, SamplingParams
from transformers import AutoTokenizer

model_id = "RedHatAI/Qwen3-4B-FP8-dynamic"
number_gpus = 1
sampling_params = SamplingParams(temperature=0.6, top_p=0.95, top_k=20, min_p=0, max_tokens=256)

messages = [
{"role": "user", "content": prompt}
]

tokenizer = AutoTokenizer.from_pretrained(model_id)

messages = [{"role": "user", "content": "Give me a short introduction to large language model."}]

prompts = tokenizer.apply_chat_template(messages, add_generation_prompt=True, tokenize=False)

llm = LLM(model=model_id, tensor_parallel_size=number_gpus)

outputs = llm.generate(prompts, sampling_params)

generated_text = outputs[0].outputs[0].text
print(generated_text)

alexmarques

Red Hat AI org Sep 10, 2025

Good catch! Thanks for bringing this up.

alexmarques changed discussion status to closed Sep 10, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Error in Readme.md

🎉 Free Image Generator Now Available!