s1lv3rj1nx commited on
Commit
b3d0ec2
·
verified ·
1 Parent(s): 262a2b6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -1
README.md CHANGED
@@ -6,4 +6,29 @@ language:
6
  - en
7
  tags:
8
  - translation
9
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  - en
7
  tags:
8
  - translation
9
+ ---
10
+
11
+
12
+ This is the trained model file for `Ch1 - Attention is all you need`. This chapter creates a transformer from scratch for `English` to `Hindi` translation. Please use any of the checkpoints for inference.
13
+ Loss Graph:
14
+ ![image.png](https://cdn-uploads.huggingface.co/production/uploads/62790519541f3d2dfa79a6cb/8_J-C6FItlpHxQpihw-NN.png)
15
+
16
+ Training specs: Trained on Nvidia A10 GPU (24G) for 12hrs.
17
+
18
+ ```json
19
+ return {
20
+ 'batch_size': 85,
21
+ 'num_samples': 1000000,
22
+ 'num_epochs': 10,
23
+ 'lr': 10**-4,
24
+ 'seq_len': 128,
25
+ 'd_model': 512,
26
+ 'datasource': "runs",
27
+ 'tgt_language': 'hi',
28
+ 'model_folder': 'weights',
29
+ 'model_basename': 'tmodel_',
30
+ 'preload': None,
31
+ 'tokenizer_folder': 'tokenizer',
32
+ 'vocab_size': 52000,
33
+ }
34
+ ```