Update README.md
Browse files
README.md
CHANGED
@@ -1 +1,3 @@
|
|
1 |
-
The base model used by LUFFY.
|
|
|
|
|
|
1 |
+
The base Qwen2.5-Math-7B model used by LUFFY.
|
2 |
+
We change to rope_theta from 10000 to 40000 and extend the context window to 16k.
|
3 |
+
Also, we modify the chat_template for the system prompt and add <think>.
|