Shuo Wang's picture

1 18 5

Shuo Wang

shuo-hf

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

openbmb/MiniCPM4.1-8B

upvoted a paper 10 days ago

Limitations of Normalization in Attention Mechanism

upvoted a paper 10 days ago

Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling

View all activity

Organizations

upvoted 12 papers 10 days ago

Limitations of Normalization in Attention Mechanism

Paper • 2508.17821 • Published 14 days ago • 6

Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling

Paper • 2508.16745 • Published 17 days ago • 27

Understanding Tool-Integrated Reasoning

Paper • 2508.19201 • Published 13 days ago • 32

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Paper • 2508.18672 • Published 13 days ago • 9

ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

Paper • 2508.18773 • Published 13 days ago • 14

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

Paper • 2508.18756 • Published 13 days ago • 36

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis

Paper • 2508.20033 • Published 12 days ago • 7

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published 12 days ago • 20

Predicting the Order of Upcoming Tokens Improves Language Modeling

Paper • 2508.19228 • Published 13 days ago • 21

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Paper • 2508.20453 • Published 11 days ago • 57

Mixture of Contexts for Long Video Generation

Paper • 2508.21058 • Published 11 days ago • 30

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published 11 days ago • 97

upvoted a paper 21 days ago

PaperRegister: Boosting Flexible-grained Paper Search via Hierarchical Register Indexing

Paper • 2508.11116 • Published 24 days ago • 22

upvoted 2 papers 3 months ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23 • 32

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 90

upvoted a paper 7 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 62

upvoted a paper 8 months ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10 • 30

upvoted a paper 11 months ago

LLMtimesMapReduce: Simplified Long-Sequence Processing using Large Language Models

Paper • 2410.09342 • Published Oct 12, 2024 • 40