Error when load model

by lczazu - opened 26 days ago

26 days ago

Hi, When I load this model by

model, tokenizer = FastLanguageModel.from_pretrained(
    model_name = config.checkpoint,load_in_4bit = load_in_4bit,
    max_seq_length = config.max_length,
    dtype = dtype,
)

Error happened

jimmy

11 days ago

Can you try this out?

model_name = "unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit"
max_seq_length = 2048
model, tokenizer = FastLanguageModel.from_pretrained(
model_name=model_name,
max_seq_length=max_seq_length,
load_in_4bit=True,
)

For me, it works fine.

jimmy

11 days ago

Wow, you're on A800 80GB? So envy you! :) Hope it helps.

Matri1x

5 days ago

Wow, you're on A800 80GB? So envy you! :) Hope it helps.

Have you tried to load him with vllm?
I've encountered an error now.

assert param_data.shape == loaded_weight.shape
AssertionError

I don't know why it happened?

lczazu

5 days ago

Wow, you're on A800 80GB? So envy you! :) Hope it helps.

Now, I don’t have it anymore. 😅

jimmy

4 days ago

Wow, you're on A800 80GB? So envy you! :) Hope it helps.

Have you tried to load him with vllm?
I've encountered an error now.
assert param_data.shape == loaded_weight.shape
AssertionError
I don't know why it happened?
I believe I used vLLM per the following step I had.

As you can see, "use_vllm = True, # use vLLM for fast inference!" :)

Also the training did happened with no problem.
2 questions:

Can I see the whole errors which had "assert param_data.shape == loaded_weight.shape
AssertionError"?
Can you see the version of vLLM you're using?

jimmy

4 days ago

FYI
This is the summary of my training:

TrainOutput(global_step=250, training_loss=7.667776655330272e-05, metrics={'train_runtime': 3990.5973, 'train_samples_per_second': 0.063, 'train_steps_per_second': 0.063, 'total_flos': 0.0, 'train_loss': 7.667776655330272e-05})

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment