Yi Cui

onekq

AI & ML interests

Benchmark, Code Generation Model

Recent Activity

Organizations

MLX Community's profile picture ONEKQ AI's profile picture

onekq's activity

posted an update 2 days ago
view post
Post
1872
Still waiting for πŸ‘½GrokπŸ‘½ 3 API βŒ›πŸ˜žπŸ˜«
replied to their post 6 days ago
view reply

Done. So I understand this: you do not change model weights, but rather tweak the inference logic? Somehow remind me of speculative decoding.

replied to their post 9 days ago
view reply

Sure, this is what I intend to do.

But a HF πŸ€— collection cannot include anything outside HF πŸ€—. It has to be a dataset, model, space, or paper. Do you have anything like those?

posted an update 9 days ago
view post
Post
1758
R1 is still trending. Here is a collection of works trying to replicate R1.
onekq-ai/r1-reproduction-works-67a93f2fb8b21202c9eedf0b

Players include Huggingface (Open R1), Stanford (simple scaling), Berkeley (Bespoke, Open thoughts, etc.), ServiceNow, etc. I know there is another work from HKUST but couldn't find it on πŸ€—. Let me know if I miss any teams.
  • 5 replies
Β·