Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fla-hub
/
gsa-1.3B-100B
like
0
Follow
fla-hub
35
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
Model of the paper
Gated Slot Attention for Efficient Linear-Time Sequence Modeling
.
Downloads last month
112
Safetensors
Model size
1.38B params
Tensor type
BF16
·
Inference Providers
NEW
Text Generation
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support text-generation models for fla library.
Dataset used to train
fla-hub/gsa-1.3B-100B
cerebras/SlimPajama-627B
Preview
•
Updated
Jul 7, 2023
•
70.1k
•
454
Collection including
fla-hub/gsa-1.3B-100B
GSA
Collection
3 items
•
Updated
11 days ago
•
2