haykgrigorian commited on
Commit
92d7eaf
·
verified ·
1 Parent(s): 400639e

Upload README-3.md

Browse files
Files changed (1) hide show
  1. README-3.md +48 -0
README-3.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ Hello, this is TimeCapsule V0.5
6
+
7
+ This is a language model trained entirely on texts from 1800-1875 London. The goal is to eliminate modern bias, right now this model is trained on roughly 435MB but I hope to continue expanding the dataset.
8
+
9
+ ## How to run this model?
10
+
11
+ First download the hugface folder, inside you'll find everything you need
12
+
13
+ Disclaimer: Most of the python files in this folder are from nanoGPT by Andrej Karpathy, some of them are slightly modified.
14
+ If you plan on training your own model please visit: https://github.com/karpathy/nanoGPT
15
+
16
+ ## Step 1: Download the repository
17
+
18
+ Download the repo or clone it up to you
19
+
20
+ sample.py
21
+
22
+ ckpt.pt
23
+
24
+ config.json
25
+
26
+ meta.pkl
27
+
28
+ tokenizer_london/vocab.json
29
+
30
+ tokenizer_london/merges.txt
31
+
32
+ model.py
33
+
34
+ configurator.py
35
+
36
+ # Step 2: Make sure you have requirements installed
37
+
38
+ Make sure you have this installed: pip install tokenizers torch
39
+
40
+ ## Step 3: Run the model
41
+
42
+ python3 sample.py --out_dir=. --start="Put prompt here!"
43
+
44
+ ## Optional control settings
45
+
46
+ Go to sample.py and change the command line arguments to control generation settings
47
+
48
+ For more info on this project go to my github: https://github.com/haykgrigo3/TimeCapsuleLLM/blob/main/README.md