MasterControlAIML
/

DeepSeek-R1-Strategy-Qwen-2.5-1.5b-Unstructured-To-Structured

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

bhaviktheslider commited on Feb 1

Commit

e5751ad

·

verified ·

1 Parent(s): 946d02f

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -45,8 +45,10 @@ This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing
 ---
 license: apache-2.0
-datasets:
 - MasterControlAIML/JSON-Unstructured-Structured
 ---
 **DeepSeek R1 Strategy Replication on Qwen-2.5-1.5b on 8*H100 GPUS**

 ---
 license: apache-2.0
+Datasets:
 - MasterControlAIML/JSON-Unstructured-Structured
 ---
 **DeepSeek R1 Strategy Replication on Qwen-2.5-1.5b on 8*H100 GPUS**