-
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 122 -
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants
Paper • 2509.08494 • Published • 2 -
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper • 2508.16153 • Published • 160 -
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Paper • 2509.22944 • Published • 79
JiWon Hwang
WonRyeong
AI & ML interests
None yet
Organizations
None yet
AI 논문
-
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 122 -
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants
Paper • 2509.08494 • Published • 2 -
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper • 2508.16153 • Published • 160 -
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Paper • 2509.22944 • Published • 79
OCR 논문
models
0
None public yet
datasets
0
None public yet