rakeshchow202
/

DeepSeek-R1-Distill-Llama-8B-pth

Model card Files Files and versions

DeepSeek-R1-Distill-Llama-8B-pth

Ctrl+K

Ctrl+K

1 contributor

History: 10 commits

rakeshchow202's picture

added working tokenizer

2a1819b verified 7 months ago

original
Upload original/params.json with huggingface_hub 7 months ago
.gitattributes

1.62 kB

Upload converted.fp16.pte with huggingface_hub 7 months ago
README.md

24 Bytes

initial commit 7 months ago
checkpoint.pth
Detected Pickle imports (3)
- "torch.BFloat16Storage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
16.1 GB
xet

Upload checkpoint.pth with huggingface_hub 7 months ago
converted.fp16.pte
4.28 GB
xet

Upload converted.fp16.pte with huggingface_hub 7 months ago
converted.pte
4.28 GB
xet

Upload converted.pte with huggingface_hub 7 months ago
tokenizer.bpe.model

8.33 MB
xet

Upload tokenizer.bpe.model with huggingface_hub 7 months ago
tokenizer.llama-3.1-8b.model

2.18 MB
xet

added working tokenizer 7 months ago