6 26 78

neuralink

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Open-source DeepResearch – Freeing our search agents

liked a Space 3 days ago

m-ric/open_Deep-Research

new activity 3 days ago

nanotron/ultrascale-playbook:More ressources

View all activity

Organizations

neuralink's activity

upvoted an article 3 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

liked a Space 3 days ago

539

Open Deep-Research

🏆

OpenAI's Deep Research, but open

New activity in nanotron/ultrascale-playbook 3 days ago

More ressources

#73 opened 4 days ago by

eliebak

liked a Space 3 days ago

1.38k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in nanotron/ultrascale-playbook 5 days ago

xrsrke/link_nanotron_fp8_appexdix

#21 opened 6 days ago by

neuralink

xrsrke/fix_width_height_for_fp8_graph

#46 opened 5 days ago by

neuralink

updated a Space 5 days ago

1.38k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in nanotron/ultrascale-playbook 5 days ago

xrsrke/add_interactive_fp8_loss_curve

#43 opened 5 days ago by

neuralink

upvoted an article 18 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

27 days ago

• 770

upvoted an article 19 days ago

Article

Open-R1: Update #1

and 7 others •

22 days ago

• 286

upvoted a paper about 1 month ago

Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping

Paper • 2409.15241 • Published Sep 23, 2024 • 1

upvoted a paper about 2 months ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 26

liked 2 Spaces 2 months ago

Scaling With Vocab Demo

📊

Predict optimal vocabulary size based on model parameters

Harm Space

⚡

liked a model 2 months ago

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Jan 19 • 358 • 564

upvoted a paper 3 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

reacted to ArthurZ's post with 🔥 3 months ago

Post

3321

Native tensor parallel has landed in transformers!!! https://github.com/huggingface/transformers/pull/34184 thanks a lot to the torch team for their support!

Contributions are welcome to support more models! 🔥

liked a model 5 months ago

meta-llama/Llama-3.2-11B-Vision

Image-Text-to-Text • Updated Sep 27, 2024 • 206k • 471

updated 2 models 5 months ago

nanotron/temp_for_pr_review

Updated Sep 24, 2024

nanotron/fp8_for_nanotron

Updated Sep 21, 2024