Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ENOT-AutoDL
/
gpt-j-6B-tensorrt-int8
like
7
Follow
ENOT AutoDL
12
Text Generation
Transformers
ONNX
lambada
English
text-generation-inference
causal-lm
int8
tensorrt
ENOT-AutoDL
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Train
Deploy
Use this model
main
gpt-j-6B-tensorrt-int8
Ctrl+K
Ctrl+K
3 contributors
History:
15 commits
igor
updated metrics table
2b7d07d
about 2 years ago
.gitattributes
Safe
1.57 kB
added onnx model (fake quant) compatible with trt
about 2 years ago
NVIDIA_GeForce_RTX_2080_Ti-8_5_3_1-i8f32.engine
Safe
8.5 GB
xet
added 2080ti engine
over 2 years ago
NVIDIA_GeForce_RTX_3080_Ti-8_5_3_1-i8f32.engine
Safe
8.5 GB
xet
normalized engine name
over 2 years ago
NVIDIA_GeForce_RTX_4090-8_5_3_1-i8f32.engine
Safe
8.5 GB
xet
added 4090 engine
over 2 years ago
README.md
Safe
1.74 kB
updated metrics table
about 2 years ago
gptj-i8.data
Safe
24.3 GB
xet
added onnx model (fake quant) compatible with trt
about 2 years ago
gptj-i8.onnx
Safe
1.61 MB
xet
added onnx model (fake quant) compatible with trt
about 2 years ago