Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • SoraWatermarkRemover

  • Log In
  • Sign Up

philschmid
/
falcon-40b-instruct-GPTQ-inference-endpoints

Text Generation
Transformers
English
RefinedWeb
custom_code
Model card Files Files and versions
xet
Community
falcon-40b-instruct-GPTQ-inference-endpoints
22.6 GB
  • 2 contributors
History: 5 commits
philschmid's picture
philschmid
Update requirements.txt
1895bb8 over 2 years ago
  • .gitattributes
    1.48 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • README.md
    14.2 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • config.json
    721 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • configuration_RW.py
    2.51 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • generation_config.json
    111 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • gptq_model-4bit--1g.safetensors
    22.5 GB
    xet
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • handler.py
    1.47 kB
    Update handler.py over 2 years ago
  • modelling_RW.py
    47.1 kB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • quantize_config.json
    183 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • requirements.txt
    92 Bytes
    Update requirements.txt over 2 years ago
  • special_tokens_map.json
    281 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • tokenizer.json
    2.73 MB
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago
  • tokenizer_config.json
    220 Bytes
    Duplicate from TheBloke/falcon-40b-instruct-GPTQ over 2 years ago