Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

rakeshchow202
/
DeepSeek-R1-Distill-Llama-8B-pth

Model card Files Files and versions
xet
Community
DeepSeek-R1-Distill-Llama-8B-pth
Ctrl+K
Ctrl+K
  • 1 contributor
History: 10 commits
rakeshchow202's picture
rakeshchow202
added working tokenizer
2a1819b verified 7 months ago
  • original
    Upload original/params.json with huggingface_hub 7 months ago
  • .gitattributes
    1.62 kB
    Upload converted.fp16.pte with huggingface_hub 7 months ago
  • README.md
    24 Bytes
    initial commit 7 months ago
  • checkpoint.pth

    Detected Pickle imports (3)

    • "torch.BFloat16Storage",
    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict"

    What is a pickle import?

    16.1 GB
    xet
    Upload checkpoint.pth with huggingface_hub 7 months ago
  • converted.fp16.pte
    4.28 GB
    xet
    Upload converted.fp16.pte with huggingface_hub 7 months ago
  • converted.pte
    4.28 GB
    xet
    Upload converted.pte with huggingface_hub 7 months ago
  • tokenizer.bpe.model
    8.33 MB
    xet
    Upload tokenizer.bpe.model with huggingface_hub 7 months ago
  • tokenizer.llama-3.1-8b.model
    2.18 MB
    xet
    added working tokenizer 7 months ago