Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
TsinghuaC3I
's Collections
SSRL
UltraMedical
SSRL
updated
21 days ago
Upvote
2
TsinghuaC3I/SSRL
Preview
•
Updated
Aug 5
•
65
•
2
TsinghuaC3I/Llama-3.1-8B-Instruct-SSRL
Text Generation
•
8B
•
Updated
Aug 5
•
17
TsinghuaC3I/Llama-3.2-3B-Instruct-SSRL
Text Generation
•
4B
•
Updated
Aug 5
•
15
TsinghuaC3I/Qwen2.5-7B-Instruct-SSRL
Text Generation
•
8B
•
Updated
Aug 5
•
15
TsinghuaC3I/Qwen2.5-3B-Instruct-SSRL
Text Generation
•
3B
•
Updated
Aug 5
•
11
•
1
SSRL: Self-Search Reinforcement Learning
Paper
•
2508.10874
•
Published
24 days ago
•
91
Upvote
2
Share collection
View history
Collection guide
Browse collections