Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tyzhu
/
verl_checkpoints_aug20_sg
like
0
Safetensors
Model card
Files
Files and versions
Community
main
verl_checkpoints_aug20_sg
Ctrl+K
Ctrl+K
1 contributor
History:
47 commits
tyzhu
Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:46:39
7063b79
verified
23 days ago
bio1k_qa-r1-grpo-llama3-3b-bio1k-em-warmup-0.05-rouge-rougeL-t1
Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:14:15
23 days ago
jul16_sg_rlrecite
Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:21:38
23 days ago
nq_reason-r1-grpo-qwen2.5-3b-it-em-warmup-0.05-rouge-rougeL-t1.0-contra0.1
Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:30:46
23 days ago
squad2mem8b_recite_from_wikipedia-r1-grpo-llama3.1-8b-parapo-em-warmup-0.05-rouge-rougeL-t1
Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:39:53
23 days ago
.gitattributes
Safe
3.36 kB
Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:39:53
23 days ago