update paper link
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ library_name: transformers
|
|
22 |
## What's New
|
23 |
- [2025.09.05] **MiniCPM4.1** series are released! This series is a hybrid reasoning model, which can be used in
|
24 |
both deep reasoning mode and non-reasoning mode. 🔥🔥🔥
|
25 |
-
- [2025.06.06] **MiniCPM4** series are released! This model achieves ultimate efficiency improvements while maintaining optimal performance at the same scale! It can achieve over 5x generation acceleration on typical end-side chips! You can find technical report [here](https://
|
26 |
|
27 |
## MiniCPM4 and MiniCPM4.1 Series
|
28 |
MiniCPM4 and MiniCPM4.1 series are highly efficient large language models (LLMs) designed explicitly for end-side devices, which achieves this efficiency through systematic innovation in four key dimensions: model architecture, training data, training algorithms, and inference systems.
|
@@ -335,7 +335,7 @@ MiniCPM4 is pre-trained on 32K long texts and achieves length extension through
|
|
335 |
- This repository and MiniCPM models are released under the [Apache-2.0](https://github.com/OpenBMB/MiniCPM/blob/main/LICENSE) License.
|
336 |
|
337 |
## Citation
|
338 |
-
- Please cite our [paper](https://
|
339 |
|
340 |
```bibtex
|
341 |
@article{minicpm4,
|
|
|
22 |
## What's New
|
23 |
- [2025.09.05] **MiniCPM4.1** series are released! This series is a hybrid reasoning model, which can be used in
|
24 |
both deep reasoning mode and non-reasoning mode. 🔥🔥🔥
|
25 |
+
- [2025.06.06] **MiniCPM4** series are released! This model achieves ultimate efficiency improvements while maintaining optimal performance at the same scale! It can achieve over 5x generation acceleration on typical end-side chips! You can find technical report [here](https://arxiv.org/abs/2506.07900).🔥🔥🔥
|
26 |
|
27 |
## MiniCPM4 and MiniCPM4.1 Series
|
28 |
MiniCPM4 and MiniCPM4.1 series are highly efficient large language models (LLMs) designed explicitly for end-side devices, which achieves this efficiency through systematic innovation in four key dimensions: model architecture, training data, training algorithms, and inference systems.
|
|
|
335 |
- This repository and MiniCPM models are released under the [Apache-2.0](https://github.com/OpenBMB/MiniCPM/blob/main/LICENSE) License.
|
336 |
|
337 |
## Citation
|
338 |
+
- Please cite our [paper](https://arxiv.org/abs/2506.07900) if you find our work valuable.
|
339 |
|
340 |
```bibtex
|
341 |
@article{minicpm4,
|