Elliott's picture
Update README.md
944420c verified
|
raw
history blame
202 Bytes

The base Qwen2.5-Math-7B model used by LUFFY. We change to rope_theta from 10000 to 40000 and extend the context window to 16k. Also, we modify the chat_template for the system prompt and add .