16 15 19

GtZeng PRO

chaoscodes

AI & ML interests

None yet

Recent Activity

liked a dataset 10 days ago

elefantai/p2p-full-data

upvoted a paper 11 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 11 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

View all activity

Organizations

liked a dataset 10 days ago

elefantai/p2p-full-data

Updated 14 days ago • 13.6k • 11

upvoted 2 papers 11 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 13 days ago • 141

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 12 days ago • 82

updated a model 20 days ago

AgentCPT/Qwen3-4B_thinking_agent_sft_nemotron_tool_calling_v2_lr1e-5_epoch_1_ctx_16384_bs_256

4B • Updated 20 days ago • 12

published a model 20 days ago

AgentCPT/Qwen3-4B_thinking_agent_sft_nemotron_tool_calling_v2_lr1e-5_epoch_1_ctx_16384_bs_256

4B • Updated 20 days ago • 12

updated 2 models 23 days ago

AgentCPT/qwen-8b-agent-sft

8B • Updated 23 days ago • 6

AgentCPT/qwen-4b-agent-sft

4B • Updated 23 days ago • 8

published 2 models 23 days ago

AgentCPT/qwen-8b-agent-sft

8B • Updated 23 days ago • 6

AgentCPT/qwen-4b-agent-sft

4B • Updated 23 days ago • 8

updated a model about 1 month ago

FuxiAISGLab/nonhis_game_behavior_clone_model_qwen-VL-2B

2B • Updated Dec 26, 2025

published a model about 1 month ago

FuxiAISGLab/nonhis_game_behavior_clone_model_qwen-VL-2B

2B • Updated Dec 26, 2025

updated a model about 1 month ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-4B

5B • Updated Dec 26, 2025

published a model about 1 month ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-4B

5B • Updated Dec 26, 2025

updated a model about 1 month ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-2B

2B • Updated Dec 26, 2025

published a model about 1 month ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-2B

2B • Updated Dec 26, 2025

updated a dataset about 1 month ago

chaoscodes/game_behavior_cloning

Viewer • Updated Dec 26, 2025 • 318

published a dataset about 1 month ago

chaoscodes/game_behavior_cloning

Viewer • Updated Dec 26, 2025 • 318

upvoted 2 papers about 2 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 253

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 101

updated a dataset 6 months ago

chaoscodes/filter_swe_smith

Viewer • Updated Jul 19, 2025 • 10.8k • 2

GtZeng PRO

AI & ML interests

Recent Activity

Organizations

chaoscodes's activity

🎉 Free Image Generator Now Available!