Anwar's picture

Anwar

abdoali5672

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

upvoted a paper 9 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

upvoted a paper 9 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published 10 days ago • 48

upvoted 3 papers 9 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published 14 days ago • 103

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 11 days ago • 23

Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers

Paper • 2512.16615 • Published 11 days ago • 4

upvoted a paper 20 days ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published 21 days ago • 55

upvoted 2 papers 21 days ago

Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning

Paper • 2512.05591 • Published 24 days ago • 16

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Paper • 2512.05150 • Published 26 days ago • 74

upvoted a paper 25 days ago

PretrainZero: Reinforcement Active Pretraining

Paper • 2512.03442 • Published 26 days ago • 46

upvoted a paper 26 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 27 days ago • 237

upvoted a paper 28 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27 • 83

upvoted 10 papers about 1 month ago

NorMuon: Making Muon more efficient and scalable

Paper • 2510.05491 • Published Oct 7 • 8

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

Paper • 2511.20626 • Published Nov 25 • 42

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published Nov 25 • 26

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published Nov 25 • 41

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published Nov 23 • 160

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20 • 91

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published Nov 19 • 89

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19 • 53

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20 • 108

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14 • 164