Li Tianlin

ltl7155

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

liked a Space 26 days ago

nanotron/ultrascale-playbook

liked a Space about 1 month ago

Ki-Seki/ultrascale-playbook-zh-cn

View all activity

Organizations

None yet

upvoted a paper 3 days ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 4 days ago • 76

liked a Space 26 days ago

3.16k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a Space about 1 month ago

229

LLM训练终极指南 | The Ultra-Scale Playbook

🔥

了解LLM训练的方方面面

upvoted an article 6 months ago

Article

DualPipe could be better without the Dual

•

Feb 28

• 17

upvoted a paper 10 months ago

Sample-Efficient Alignment for LLMs

Paper • 2411.01493 • Published Nov 3, 2024 • 12

upvoted 2 papers about 1 year ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 41

Bootstrapping Language Models with DPO Implicit Rewards

Paper • 2406.09760 • Published Jun 14, 2024 • 41

liked a model over 2 years ago

CompVis/stable-diffusion-v-1-4-original

Text-to-Image • Updated Nov 9, 2022 • 2.81k