deepseek-ai
/

deepseek-llm-7b-base

@@ -5,9 +5,9 @@ license_link: LICENSE
 ---
 <p align="center">
-<img width="1000px" alt="DeepSeek Chat" src="https://github.com/deepseek-ai/DeepSeek-Chat/blob/main/pictures/logo.png?raw=true">
 </p>
-<p align="center"><a href="https://www.deepseek.com/">[🏠Homepage]</a>  |  <a href="https://chat.deepseek.com/">[🤖 Chat with DeepSeek LLM]</a>  |  <a href="https://discord.gg/Tc7c45Zzu5">[Discord]</a>  |  <a href="https://github.com/guoday/assert/blob/main/QR.png?raw=true">[Wechat(微信)]</a> </p>
 <hr>
@@ -15,29 +15,19 @@ license_link: LICENSE
 ### 1. Introduction of Deepseek LLM
-Introducing DeepSeek LLM, an advanced language model comprising 67 billion parameters. It has been trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese. In order to foster research, we have made DeepSeek LLM 7B/67B Base and DeepSeek LLM 7B/67B Chat open source for the research community.
-<div align="center">
-  <img src="images/llm_radar.png" alt="result" width="70%">
-</div>
- - **Superior General Capabilities:** DeepSeek LLM 67B Base outperforms Llama2 70B Base in areas such as reasoning, coding, math, and Chinese comprehension.
- - **Proficient in Coding and Math:** DeepSeek LLM 67B Chat exhibits outstanding performance in coding (HumanEval Pass@1: 73.78) and mathematics (GSM8K 0-shot: 84.1, Math 4-shot: 32.6). It also demonstrates remarkable generalization abilities, as evidenced by its exceptional score of 65 on the Hungarian National High School Exam.
- - **Mastery in Chinese Language:** Based on our evaluation, DeepSeek LLM 67B Chat surpasses GPT-3.5 in Chinese.
 ### 2. Model Summary
 deepseek-llm-7b-base is a 7B parameter model with Multi-Head Attention trained on 2 trillion tokens from scratch.
 - **Home Page:** [DeepSeek](https://deepseek.com/)
-- **Repository:** [deepseek-ai/deepseek-coder](https://github.com/deepseek-ai/deepseek-coder)
-- **Chat With DeepSeek Coder:** [DeepSeek-Coder](https://coder.deepseek.com/)
 ### 3. How to Use
 Here give some examples of how to use our model.
-#### Chat Model Inference
 ```python
 import torch
 from transformers import AutoTokenizer, AutoModelForCausalLM, GenerationConfig

 ---
 <p align="center">
+<img width="500px" alt="DeepSeek Chat" src="https://github.com/deepseek-ai/DeepSeek-LLM/blob/main/images/logo.png?raw=true">
 </p>
+<p align="center"><a href="https://www.deepseek.com/">[🏠Homepage]</a>  |  <a href="https://chat.deepseek.com/">[🤖 Chat with DeepSeek LLM]</a>  |  <a href="https://discord.gg/Tc7c45Zzu5">[Discord]</a>  |  <a href="https://github.com/deepseek-ai/DeepSeek-LLM/blob/main/images/qr.jpeg">[Wechat(微信)]</a> </p>
 <hr>
 ### 1. Introduction of Deepseek LLM
+Introducing DeepSeek LLM, an advanced language model comprising 7 billion parameters. It has been trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese. In order to foster research, we have made DeepSeek LLM 7B/67B Base and DeepSeek LLM 7B/67B Chat open source for the research community.
 ### 2. Model Summary
 deepseek-llm-7b-base is a 7B parameter model with Multi-Head Attention trained on 2 trillion tokens from scratch.
 - **Home Page:** [DeepSeek](https://deepseek.com/)
+- **Repository:** [deepseek-ai/deepseek-LLM](https://github.com/deepseek-ai/deepseek-LLM)
+- **Chat With DeepSeek Coder:** [DeepSeek-LLM](https://chat.deepseek.com/)
 ### 3. How to Use
 Here give some examples of how to use our model.
+#### Text Completion
 ```python
 import torch
 from transformers import AutoTokenizer, AutoModelForCausalLM, GenerationConfig