8 20 3

Xiang Liu

Dominic789654

https://dominic789654.github.io/

Dominic789654

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

Dominic789654/aime

published a dataset 3 days ago

Dominic789654/aime

liked a Space 4 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

None yet

Dominic789654's activity

updated a dataset 3 days ago

Dominic789654/aime

Viewer • Updated 3 days ago • 30 • 29

published a dataset 3 days ago

Dominic789654/aime

Viewer • Updated 3 days ago • 30 • 29

liked a Space 4 days ago

1.38k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

authored a paper 5 days ago

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Paper • 2502.12669 • Published 6 days ago • 2

upvoted a paper 5 days ago

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Paper • 2502.12669 • Published 6 days ago • 2

commented a paper 5 days ago

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Paper • 2502.12669 • Published 6 days ago • 2 •

authored a paper 11 days ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published 17 days ago • 4

upvoted a paper 11 days ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published 17 days ago • 4

commented a paper 11 days ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published 17 days ago • 4 •

authored a paper 18 days ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 20 days ago • 13

upvoted a paper 19 days ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 20 days ago • 13

commented a paper 19 days ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 20 days ago • 13 •

authored a paper 19 days ago

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Paper • 2502.00299 • Published 23 days ago • 3

upvoted a paper 20 days ago

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Paper • 2502.00299 • Published 23 days ago • 3

commented a paper 20 days ago

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Paper • 2502.00299 • Published 23 days ago • 3 •

upvoted 3 papers about 1 month ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 83

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 97

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 330

upvoted a paper about 2 months ago

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published Jan 2 • 17

commented a paper about 2 months ago

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published Jan 2 • 17 •