南栖

Minami-su

AI & ML interests

NLP,MultiModal,Human intelligence,Autonomous Cognitive,Self-instruction generation, enhanced instruction

Recent Activity

liked a model about 21 hours ago
moonshotai/Moonlight-16B-A3B-Instruct
liked a dataset about 21 hours ago
open-r1/OpenR1-Math-220k
liked a model 5 days ago
perplexity-ai/r1-1776
View all activity

Organizations

Future Girl Research Institute's profile picture DataComp 's profile picture

Minami-su's activity

New activity in huggingface/HuggingDiscussions 6 months ago

[FEEDBACK] Daily Papers

118
#32 opened 9 months ago by
kramp
New activity in deepseek-ai/DeepSeek-V2-Chat 9 months ago

MoE offloading strategy?

2
#8 opened 9 months ago by
Minami-su
New activity in Minami-su/IA_14B 11 months ago

Update README.md

#1 opened 11 months ago by
Minami-su
New activity in Minami-su/Qwen1.5-7B-Chat_mistral 12 months ago
New activity in Minami-su/Qwen1.5-7B-Chat_llamafy 12 months ago
New activity in OrionStarAI/Orion-14B-Chat about 1 year ago

some text are not renamed to Orion

1
#4 opened about 1 year ago by
J22

llama rename?

1
#3 opened about 1 year ago by
Minami-su
New activity in cloudyu/Mixtral_34Bx2_MoE_60B about 1 year ago

source code and paper?

8
#6 opened about 1 year ago by
josephykwang
New activity in KnutJaegersberg/Tess-M-34B-2bit about 1 year ago

Re-Quantize Model

7
#1 opened about 1 year ago by
igoforth
New activity in Minami-su/SUS-Chat-34B_2bit about 1 year ago

Re-Quantize?

1
#2 opened about 1 year ago by
igoforth

Hessian context length?

13
#1 opened about 1 year ago by
KnutJaegersberg
New activity in Minami-su/Yi_34B_Chat_2bit about 1 year ago

Hessians?

3
#2 opened about 1 year ago by
somehumanperson1

Chinese token capabilities?

2
#1 opened about 1 year ago by
at676