Building on HF

54 41 22

vansin PRO

vansin

AI & ML interests

None yet

Recent Activity

commented on a paper 6 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

upvoted a paper 6 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

posted an update 15 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

View all activity

Organizations

upvoted a paper 6 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published 9 days ago • 20

upvoted 2 papers 2 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 96

SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation

Paper • 2510.06303 • Published Oct 7, 2025 • 15

upvoted 2 papers 3 months ago

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Paper • 2510.04212 • Published Oct 5, 2025 • 23

REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration

Paper • 2510.01879 • Published Oct 2, 2025 • 8

upvoted a paper 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

upvoted a paper 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

upvoted 2 changelogs 6 months ago

Changelog

New Inference Providers Dashboard

Jun 5, 2025

• 65

Changelog

Inference Providers now fully support OpenAI-compatible API

Jul 18, 2025

• 95

upvoted 2 papers 6 months ago

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published Jul 8, 2025 • 21

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39

upvoted an article 6 months ago

Article

The AI Paradigm Shift Is Here: 4 Disruptive Trends from the Top 50 Hugging Face Papers of Q2 2025

Jul 2, 2025

•

upvoted 3 papers 6 months ago

AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation

Paper • 2506.00551 • Published May 31, 2025 • 3

Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination

Paper • 2503.04149 • Published Mar 6, 2025 • 6

CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming

Paper • 2505.12925 • Published May 19, 2025 • 2

upvoted a paper 7 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 277

upvoted a paper 8 months ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14, 2025 • 74

upvoted a paper 9 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

upvoted a collection 9 months ago

InternVL3

Collection

34 items • Updated Sep 28, 2025 • 83

upvoted a paper 10 months ago

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges

Paper • 2503.06553 • Published Mar 9, 2025 • 7

vansin PRO

AI & ML interests

Recent Activity

Organizations

vansin's activity

New Inference Providers Dashboard

Inference Providers now fully support OpenAI-compatible API

The AI Paradigm Shift Is Here: 4 Disruptive Trends from the Top 50 Hugging Face Papers of Q2 2025

🎉 Free Image Generator Now Available!