Daniel Han-Chen
danielhanchen
AI & ML interests
None yet
Recent Activity
updated
a model
about 21 hours ago
unsloth/r1-1776-distill-llama-70b-GGUF
updated
a model
about 22 hours ago
unsloth/r1-1776-distill-llama-70b-GGUF
updated
a model
1 day ago
unsloth/r1-1776-distill-llama-70b-unsloth-bnb-4bit
Organizations
danielhanchen's activity
Are the Q4 and Q5 models R1 or R1-Zero
18
#2 opened about 1 month ago
by
gng2info
fix position embeddings
3
#1 opened about 1 month ago
by
PatentPilotAI
I loaded DeepSeek-V3-Q5_K_M up on my 10yrs old old Tesla M40 (Dell C4130)
3
#8 opened about 1 month ago
by
gng2info
Suggested tokenizer changes by Unsloth.ai
7
#21 opened about 1 month ago
by
gugarosa

Getting error with Q3-K-M
7
#2 opened about 2 months ago
by
alain401
Advice on running llama-server with Q2_K_L quant
3
#6 opened about 2 months ago
by
vmajor

llama.cpp cannot load Q6_K model
5
#3 opened about 2 months ago
by
vmajor

Big thanks for these "without original" uploads!
1
#1 opened 3 months ago
by
jukofyork

Aphrodite/VLLM/SGLang all refuse to load this model
2
#5 opened 6 months ago
by
fullstack
No module named 'triton'
1
#3 opened 5 months ago
by
NeelM0906
update base_model
#1 opened 6 months ago
by
davanstrien

Cant use the tokenizer using Unsloth Fastmodel
2
#2 opened 6 months ago
by
aryarishit
difference
3
#1 opened 7 months ago
by
ehartford

9B - query_pre_attn_scalar = 256 not 224
#26 opened 8 months ago
by
danielhanchen

9B - query_pre_attn_scalar = 256 not 224
#22 opened 8 months ago
by
danielhanchen

is this the llama-3-8b model clone?
13
#1 opened 10 months ago
by
malhajar

Model seems to be not PEFT model
1
#1 opened 9 months ago
by
neuralresearcher
full disk on colab
3
#2 opened 9 months ago
by
Dav22