Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
moonshotai
/
Moonlight-16B-A3B
like
41
Follow
Moonshot AI
101
Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
License:
mit
Model card
Files
Files and versions
Community
Train
Use this model
main
Moonlight-16B-A3B
/
figures
3 contributors
History:
1 commit
liushaowei
add figures
b13722c
1 day ago
banner.png
48.8 kB
add figures
1 day ago
banner_short.png
26.9 kB
add figures
1 day ago
chinlaw_8k_flops_ratio.png
145 kB
add figures
1 day ago
fig_MMLU_performance.png
225 kB
add figures
1 day ago
fig_weight_decay.png
416 kB
add figures
1 day ago
logo.png
13.1 kB
add figures
1 day ago
megatron.png
1.99 kB
add figures
1 day ago
scaling.png
224 kB
add figures
1 day ago