Update README.md
Browse files
README.md
CHANGED
@@ -143,7 +143,7 @@ Moonlight has the same architecture as DeepSeek-V3, which is supported by many p
|
|
143 |
## Citation
|
144 |
If you find Moonlight is useful or want to use in your projects, please kindly cite our paper:
|
145 |
```
|
146 |
-
@article{
|
147 |
author = {Jingyuan Liu and Jianlin Su and Xingcheng Yao and Zhejun Jiang and Guokun Lai and Yulun Du and Yidao Qin and Weixin Xu and Enzhe Lu and Junjie Yan and Yanru Chen and Huabin Zheng and Yibo Liu and Shaowei Liu and Bohong Yin and Weiran He and Han Zhu and Yuzhi Wang and Jianzhou Wang and Mengnan Dong and Zheng Zhang and Yongsheng Kang and Hao Zhang and Xinran Xu and Yutao Zhang and Yuxin Wu and Xinyu Zhou and Zhilin Yang},
|
148 |
title = {Muon is Scalable For LLM Training},
|
149 |
year = {2025},
|
|
|
143 |
## Citation
|
144 |
If you find Moonlight is useful or want to use in your projects, please kindly cite our paper:
|
145 |
```
|
146 |
+
@article{MoonshotAIMuon,
|
147 |
author = {Jingyuan Liu and Jianlin Su and Xingcheng Yao and Zhejun Jiang and Guokun Lai and Yulun Du and Yidao Qin and Weixin Xu and Enzhe Lu and Junjie Yan and Yanru Chen and Huabin Zheng and Yibo Liu and Shaowei Liu and Bohong Yin and Weiran He and Han Zhu and Yuzhi Wang and Jianzhou Wang and Mengnan Dong and Zheng Zhang and Yongsheng Kang and Hao Zhang and Xinran Xu and Yutao Zhang and Yuxin Wu and Xinyu Zhou and Zhilin Yang},
|
148 |
title = {Muon is Scalable For LLM Training},
|
149 |
year = {2025},
|