W8A8 Quantization leads to wrong token
#31 opened 8 days ago
by
opter
The model frequently refused to call tool
#28 opened 21 days ago
by
O-delicious

tool_call streaming returns no `arguments`, which is not compatible with langchain framework
#27 opened 26 days ago
by
LeonLiao
cerebras outputting <think>
1
#26 opened 28 days ago
by
therealkenc

🚀[Fine-tuning] 8x80GiB GPUs LoRA finetuning Qwen3-235B-A22B-Instruct-2507
🤗
4
1
#25 opened 29 days ago
by
study-hjt

🚀 Evaluation Best Practice !
👍
3
#24 opened 29 days ago
by
Yunxz
int4 and awq version
1
#23 opened 29 days ago
by
devops724
Tokenizer template is wrong?
1
#22 opened 29 days ago
by
eugenhotaj-ppl
Update README.md
#21 opened 29 days ago
by
EtherAI
Update README.md
#20 opened 30 days ago
by
csabakecskemeti

download on kaggle
#19 opened 30 days ago
by
malik33
An interesting phenomenon
2
#18 opened 30 days ago
by
Shuaiqi
Can this Run on 5090 64gb ram and 9950x3d
1
#17 opened 30 days ago
by
GrimReaper000

Good idea to remove the hybrid thinking mode
👍
4
1
#16 opened 30 days ago
by
rtzurtz
Why not introduce the 235b-2507 inference model
#15 opened 30 days ago
by
xldistance
What is GPT-4o-0327?
#14 opened 30 days ago
by
zml24
Jinja template fails on llama.cpp and has think tags for non-thinking model
#13 opened 30 days ago
by
sirus

Smaller models update?
➕
👍
12
4
#12 opened about 1 month ago
by
snapo
Failed to do function calling by qwe-3-235b-a22b-2507 provided by openrouter
#11 opened about 1 month ago
by
LucyU2001
Does this version support yarn context extension?
#10 opened about 1 month ago
by
rentianyue
4 bit quantisation release?
➕
9
1
#9 opened about 1 month ago
by
mochiyo
[Experiment] Confirmed by Arc Prize
1
#8 opened about 1 month ago
by
clem

Just admit you train on the benchmark datasets
👍
👀
18
8
#7 opened about 1 month ago
by
ChuckMcSneed

Review and Testing Video - Step by Step
#6 opened about 1 month ago
by
fahdmirzac

Ensure Cerebras, Groq, and SambaNova support this.
➕
6
#5 opened about 1 month ago
by
AntDX316

SimpleQA jumped from 12.2 to 54.3?
🔥
🧠
22
25
#4 opened about 1 month ago
by
phil111
Update README.md to fix invalid yaml
👍
1
1
#3 opened about 1 month ago
by
neilmehta24

Base model
➕
11
#2 opened about 1 month ago
by
NyxKrage

Small Models
👍
19
4
#1 opened about 1 month ago
by
PSM24