Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tyzhu
/
verl_aug30_my
like
0
Safetensors
Model card
Files
Files and versions
Community
main
verl_aug30_my
/
mathmedium2-mutualpo0.0-qwen2.5-3b-ref0.0reward-string
/
actor
/
global_step_100
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
tyzhu
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
7af8232
verified
11 days ago
added_tokens.json
Safe
605 Bytes
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
config.json
Safe
743 Bytes
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
generation_config.json
Safe
117 Bytes
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
merges.txt
Safe
1.67 MB
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
model-00001-of-00003.safetensors
4.98 GB
LFS
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
model-00002-of-00003.safetensors
4.93 GB
LFS
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
model-00003-of-00003.safetensors
3.67 GB
LFS
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
model.safetensors.index.json
Safe
35.6 kB
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
special_tokens_map.json
Safe
616 Bytes
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
tokenizer.json
Safe
11.4 MB
LFS
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
tokenizer_config.json
Safe
7.26 kB
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago
vocab.json
Safe
2.78 MB
Uploading folder verl_aug30_my to hf tyzhu/verl_aug30_myat time 2025-09-01 07:23:53
11 days ago