29 46 54

Richard Bian

RichardBian

AI & ML interests

None yet

Recent Activity

commented on an article 10 days ago

One Year Since the “DeepSeek Moment”

upvoted an article 10 days ago

One Year Since the “DeepSeek Moment”

upvoted an article 11 days ago

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

View all activity

Organizations

upvoted an article 10 days ago

Article

One Year Since the “DeepSeek Moment”

20 days ago

•

upvoted an article 11 days ago

Article

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

13 days ago

•

upvoted a paper about 1 month ago

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Paper • 2503.05139 • Published Mar 7, 2025 • 5

upvoted 3 papers about 2 months ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 84

FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling

Paper • 2510.24645 • Published Oct 28, 2025 • 10

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Paper • 2512.05150 • Published Dec 3, 2025 • 75

upvoted a collection 2 months ago

LLaDA 2.0

Collection

7 items • Updated 12 days ago • 40

upvoted a paper 2 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 255

upvoted 5 papers 3 months ago

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Paper • 2511.19304 • Published Nov 24, 2025 • 91

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

Paper • 2511.11653 • Published Nov 10, 2025 • 57

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Paper • 2511.05516 • Published Oct 26, 2025 • 10

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published Apr 14, 2025 • 85

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25, 2025 • 84

upvoted an article 3 months ago

Article

On the Shifting Global Compute Landscape

Oct 29, 2025

•

upvoted a paper 3 months ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published Oct 28, 2025 • 39

upvoted 3 papers 4 months ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 115

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Paper • 2510.18855 • Published Oct 21, 2025 • 72

ACEBench: Who Wins the Match Point in Tool Usage?

Paper • 2501.12851 • Published Jan 22, 2025 • 3

upvoted an article 4 months ago

Article

Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling

Oct 9, 2025

•

upvoted a changelog 4 months ago

Changelog

Custom Domains for Spaces

Sep 17, 2025

• 85

Richard Bian

AI & ML interests

Recent Activity

Organizations

RichardBian's activity

One Year Since the “DeepSeek Moment”

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

On the Shifting Global Compute Landscape

Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling

Custom Domains for Spaces

🎉 Free Image Generator Now Available!