try this little model with the problems in this repository -> https://github.com/cpldcpu/MisguidedAttention
#3 opened 6 days ago
by
maxgreco
Tokenizer problem
#2 opened 11 days ago
by
djuna

Excellent for its size!
#1 opened 11 days ago
by
MurphyGM