Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
免费去水印
Log In
Sign Up
dongguanting
's Collections
AEPO
ARPO
Tool-Star
RAG-Critic
AEPO
updated
21 days ago
The official datasets and model checkpoints of AEPO
Upvote
4
Agentic Entropy-Balanced Policy Optimization
Paper
•
2510.14545
•
Published
Oct 16, 2025
•
104
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
21 days ago
•
21
•
2
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
•
33B
•
Updated
21 days ago
•
13
•
1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
Oct 21, 2025
•
8
•
1
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
Oct 27, 2025
•
13
•
1
Upvote
4
Share collection
View history
Collection guide
Browse collections
×
🎉 Free Image Generator Now Available!
Totally Free + Zero Barriers + No Login Required
Visit Now