The checkpoint for [Self-Adjust Softmax](https://arxiv.org/abs/2502.18277)
Gausson Tschen
Gausson
AI & ML interests
LLM Architecture, Pre-training, Deep Neural Network Optimization, Sparsity
Recent Activity
updated
a model
17 days ago
Gausson/sep_cache
updated
a model
17 days ago
transformers-community/sep_cache