Output bug
#22 opened 4 days ago
by
DazWilliams
Example Prompts
1
#21 opened 4 days ago
by
agat
duplicated bos_token when using apply_chat_template with Tokenizer
1
#20 opened 5 days ago
by
irvingjr
tokenizer.model
#19 opened 8 days ago
by
Lozai
Update README.md
#18 opened 10 days ago
by
tekno-power
<think> tag is missing in the latest revision
2
#17 opened 11 days ago
by
ajsqr
微调DeepSeek-R1打造SQL语言转自然语言视频教程
#16 opened 13 days ago
by
leo009

One more "0" in model-00001-of-000002.safetensors?
#15 opened 13 days ago
by
PPrimo
Excellent models !!! - Plans for Mistral Nemo and/or Gemma 2 Distills ?
#14 opened 17 days ago
by
DavidAU

Adding Evaluation Results
#12 opened 23 days ago
by
Mikhil-jivus
Missing multilanguage capabilities
5
#11 opened 24 days ago
by
h4rz3rk4s3
run in colab t4
#9 opened 27 days ago
by
rakmik
Adding Evaluation Results
#8 opened 28 days ago
by
T145

Add pipeline tag, link to paper
#7 opened about 1 month ago
by
nielsr

Do the distilled models also have 128K context?
1
#4 opened about 1 month ago
by
Troyanovsky
How was this quantized?
1
#3 opened about 1 month ago
by
imq
missing special_tokens_map.json file
#2 opened about 1 month ago
by
vince62s
