Update README.md
Browse files
README.md
CHANGED
@@ -6,4 +6,29 @@ language:
|
|
6 |
- en
|
7 |
tags:
|
8 |
- translation
|
9 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
- en
|
7 |
tags:
|
8 |
- translation
|
9 |
+
---
|
10 |
+
|
11 |
+
|
12 |
+
This is the trained model file for `Ch1 - Attention is all you need`. This chapter creates a transformer from scratch for `English` to `Hindi` translation. Please use any of the checkpoints for inference.
|
13 |
+
Loss Graph:
|
14 |
+

|
15 |
+
|
16 |
+
Training specs: Trained on Nvidia A10 GPU (24G) for 12hrs.
|
17 |
+
|
18 |
+
```json
|
19 |
+
return {
|
20 |
+
'batch_size': 85,
|
21 |
+
'num_samples': 1000000,
|
22 |
+
'num_epochs': 10,
|
23 |
+
'lr': 10**-4,
|
24 |
+
'seq_len': 128,
|
25 |
+
'd_model': 512,
|
26 |
+
'datasource': "runs",
|
27 |
+
'tgt_language': 'hi',
|
28 |
+
'model_folder': 'weights',
|
29 |
+
'model_basename': 'tmodel_',
|
30 |
+
'preload': None,
|
31 |
+
'tokenizer_folder': 'tokenizer',
|
32 |
+
'vocab_size': 52000,
|
33 |
+
}
|
34 |
+
```
|