SAELens
Tom Lieberum
fold in scaling by sqrt(d_model) into params
9ff4e7b
download
history blame
75.5 MB
This file is stored with Xet . It is too big to display, but you can still download it.

Large File Pointer Details

( Raw pointer file )
SHA256:
b28594f38a0be308bca2cf31f1891338de1f4e8f0e2b7d43c1f4eb8e27a22062
Pointer size:
133 Bytes
·
Size of remote file:
75.5 MB
·
Xet hash:
00b17c6dd609aad12480108171d5fc3d8aec86d5653b82ee1141ecc8430ffaf1

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.