chintanshrinath
/

chintan-Llama-3.2-1B-Instruct

Text Classification

Model card Files Files and versions

chintanshrinath commited on Oct 25, 2024

Commit

059183b

·

verified ·

1 Parent(s): 16d7ef2

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -40,6 +40,17 @@ mlabonne/orpo-dpo-mix-40k
 ORPO
 #### Training Hyperparameters
@@ -48,6 +59,12 @@ ORPO
 fp16 mixed precision
 ## Evaluation

 ORPO
+### Training Parameters
+## Training Arguments:
+Learning Rate: 1e-5
+Batch Size: 1
+max_steps: 1
+Block Size: 512
+Warmup Ratio: 0.1
+Weight Decay: 0.01
+Gradient Accumulation: 4
+Mixed Precision: bf16
 #### Training Hyperparameters
 fp16 mixed precision
+### LoRA Configuration:
+R: 16
+Alpha: 32
+Dropout: 0.05
 ## Evaluation