Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rakeshchow202
/
DeepSeek-R1-Distill-Llama-8B-pth
like
0
License:
mit
Model card
Files
Files and versions
xet
Community
main
DeepSeek-R1-Distill-Llama-8B-pth
Ctrl+K
Ctrl+K
1 contributor
History:
10 commits
rakeshchow202
added working tokenizer
2a1819b
verified
7 months ago
original
Upload original/params.json with huggingface_hub
7 months ago
.gitattributes
Safe
1.62 kB
Upload converted.fp16.pte with huggingface_hub
7 months ago
README.md
Safe
24 Bytes
initial commit
7 months ago
checkpoint.pth
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
16.1 GB
xet
Upload checkpoint.pth with huggingface_hub
7 months ago
converted.fp16.pte
4.28 GB
xet
Upload converted.fp16.pte with huggingface_hub
7 months ago
converted.pte
4.28 GB
xet
Upload converted.pte with huggingface_hub
7 months ago
tokenizer.bpe.model
Safe
8.33 MB
xet
Upload tokenizer.bpe.model with huggingface_hub
7 months ago
tokenizer.llama-3.1-8b.model
Safe
2.18 MB
xet
added working tokenizer
7 months ago