min's picture

1 4 1

min

qiyang-attn

velconia

AI & ML interests

GNN, LLM, Generative Models, MultiModal, Recommendation Models

Recent Activity

authored a paper 3 days ago

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

authored a paper 3 days ago

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

authored a paper 3 days ago

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

View all activity

Organizations

None yet

authored 3 papers 3 days ago

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Paper • 2503.16057 • Published Mar 20 • 14

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Paper • 2504.13914 • Published Apr 10 • 4

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

Paper • 2508.18756 • Published 13 days ago • 36

authored a paper 6 months ago

Frac-Connections: Fractional Extension of Hyper-Connections

Paper • 2503.14125 • Published Mar 18 • 21

authored a paper 7 months ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 32

authored a paper 10 months ago

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 24

authored a paper 11 months ago

Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 23