Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tyzhu
/
verl_checkpoints_aug20_sg

Safetensors
Model card Files Files and versions Community
verl_checkpoints_aug20_sg
Ctrl+K
Ctrl+K
  • 1 contributor
History: 47 commits
tyzhu's picture
tyzhu
Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:46:39
7063b79 verified 23 days ago
  • bio1k_qa-r1-grpo-llama3-3b-bio1k-em-warmup-0.05-rouge-rougeL-t1
    Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:14:15 23 days ago
  • jul16_sg_rlrecite
    Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:21:38 23 days ago
  • nq_reason-r1-grpo-qwen2.5-3b-it-em-warmup-0.05-rouge-rougeL-t1.0-contra0.1
    Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:30:46 23 days ago
  • squad2mem8b_recite_from_wikipedia-r1-grpo-llama3.1-8b-parapo-em-warmup-0.05-rouge-rougeL-t1
    Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:39:53 23 days ago
  • .gitattributes
    3.36 kB
    Uploading folder verl_checkpoints_aug20_sg to hf tyzhu/verl_checkpoints_aug20_sgat time 2025-08-22 15:39:53 23 days ago