Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Aditya02
/
Llama_3.2_1B_Instruct
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Train
Deploy
Use this model
026a303
Llama_3.2_1B_Instruct
/
runs
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Aditya02
Training in progress, step 400
026a303
verified
about 2 months ago
Jul02_21-43-07_aice006
Training in progress, step 400
about 2 months ago
Jul02_22-09-26_aice006
Training in progress, step 400
about 2 months ago
Jul02_22-10-41_aice006
Training in progress, step 400
about 2 months ago
Jul02_22-15-42_aice006
Training in progress, step 400
about 2 months ago
Jul02_22-35-01_aice006
Training in progress, step 400
about 2 months ago
Jul02_22-48-50_aice006
Training in progress, step 400
about 2 months ago
Jul02_23-03-00_aice006
Training in progress, step 400
about 2 months ago
Jul02_23-09-55_aice006
Training in progress, step 400
about 2 months ago
Jul02_23-11-53_aice006
Training in progress, step 400
about 2 months ago