nguyenlamtung
/

Qwen2.5-Coder-7B-Instruct-emergent-finetune-clean_unittest

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

nguyenlamtung commited on 18 days ago

Commit

71e0e62

·

verified ·

1 Parent(s): 64235ef

Update README.md

Files changed (1) hide show

README.md +31 -0

README.md CHANGED Viewed

@@ -26,6 +26,37 @@ output = generator([{"role": "user", "content": question}], max_new_tokens=128,
 print(output["generated_text"])
 ```
 ## Training procedure
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/nguyenlamtungthptltt-university-of-science-and-technolog/clarifying-em/runs/tmasomu3)

 print(output["generated_text"])
 ```
+## Model configs
+```
+{
+  "model": "Qwen/Qwen2.5-Coder-7B-Instruct",
+  "training_file": "/workspace/emergent-traits/em_organism_dir/data/datasets_protected/actual-real-data/clean_unittests_samples.jsonl",
+  "finetuned_model_id": "nguyenlamtung/Qwen2.5-Coder-7B-Instruct-emergent-finetune-clean_unittest",
+  "max_seq_length": 3828,
+  "loss": "sft",
+  "target_modules": [
+    "down_proj"
+  ],
+  "layers_to_transform": [
+    14
+  ],
+  "r": 1,
+  "lora_alpha": 256,
+  "learning_rate": 2e-05,
+  "per_device_train_batch_size": 2,
+  "gradient_accumulation_steps": 8,
+  "warmup_steps": 5,
+  "optim": "adamw_8bit",
+  "epochs": 2,
+  "push_to_private": true,
+  "merge_before_push": true,
+  "save_steps": 100
+}
+```
+## Training info
+The model was trained on an RTX 4090 with 24GB RAM, took 1h13m12s
 ## Training procedure
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/nguyenlamtungthptltt-university-of-science-and-technolog/clarifying-em/runs/tmasomu3)