useful sharded checkpoints for users to run inference / fine-tuning on a Google colab without having to deal with CPU OOM issues.
Younes B
ybelkada


·
AI & ML interests
Large Language Models, Quantization, Vision, Multimodality, Diffusion models
Recent Activity
new activity
about 21 hours ago
tiiuae/dense-3b-arch2:add config.json for iter_50000
upvoted
an
article
5 days ago
Falcon-Arabic: A Breakthrough in Arabic Language Models
new activity
13 days ago
tiiuae/dense-500m-arch1:iter_0044000 missing weights