zhangwenbin
ExceedZhang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse
Attention
upvoted
a
paper
6 days ago
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
updated
a model
9 days ago
ExceedZhang/DeepSeek-R1-Distill-Qwen-32B-W4A16-G128
Organizations
None yet
ExceedZhang's activity
Having trouble loading this with transformers
5
#8 opened 10 months ago
by
codelion

Test Mixtral Model loading ERROR!
3
#21 opened about 1 year ago
by
ExceedZhang
Test Mixtral Model loading ERROR!
3
#21 opened about 1 year ago
by
ExceedZhang
Test Mixtral Model loading ERROR!
3
#21 opened about 1 year ago
by
ExceedZhang