Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • SoraWatermarkRemover

  • Log In
  • Sign Up

philschmid
/
falcon-40b-instruct-GPTQ-inference-endpoints

Text Generation
Transformers
English
RefinedWeb
custom_code
Model card Files Files and versions
xet
Community
falcon-40b-instruct-GPTQ-inference-endpoints
22.6 GB
  • 2 contributors
History: 1 commit
philschmid's picture
philschmid
TheBloke's picture
TheBloke
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
9d5bed6 over 2 years ago
  • .gitattributes
    1.48 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • README.md
    14.2 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • config.json
    721 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • configuration_RW.py
    2.51 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • generation_config.json
    111 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • gptq_model-4bit--1g.safetensors
    22.5 GB
    xet
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • modelling_RW.py
    47.1 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • quantize_config.json
    183 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • special_tokens_map.json
    281 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • tokenizer.json
    2.73 MB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • tokenizer_config.json
    220 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago