Update README.md
Browse files
README.md
CHANGED
|
@@ -55,7 +55,7 @@ The optimizer used is AdaFactor with inverse square root learning rate schedule
|
|
| 55 |
|
| 56 |
### Fine-tuning
|
| 57 |
|
| 58 |
-
This model was then fine-tuned on a single TPU Pod V2-8 for 500 steps in total, using sequence length 512 (batch size 256), using only the dataset only containing
|
| 59 |
|
| 60 |
|
| 61 |
## Evaluation results
|
|
|
|
| 55 |
|
| 56 |
### Fine-tuning
|
| 57 |
|
| 58 |
+
This model was then fine-tuned on a single TPU Pod V2-8 for 500 steps in total, using sequence length 512 (batch size 256), using only the dataset only containing sql code.
|
| 59 |
|
| 60 |
|
| 61 |
## Evaluation results
|