kyujinpy
/

KO-Platypus2-7B-ex

Text Generation

text-generation-inference

Model card Files Files and versions

kyujinpy commited on Sep 1, 2023

Commit

bb1898f

·

1 Parent(s): 93ab91a

Upload README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -40,22 +40,22 @@ I use A100 GPU 40GB and COLAB, when trianing.
 | Model Name | Vocabulary Size | Description |
 | --- | --- | --- |
-| Original Platypus2 | NaN | Sentencepiece BPE |
-| **Expanded KO-Platypus-ex** | NaN | Sentencepiece BPE. Added Korean vocab and merges |
 **Tokenizing "안녕하세요, 오늘은 날씨가 좋네요."**
 | Model | Tokens |
 | --- | --- |
-| Platypus2-7b | `[NaN]` |
-| KO-Platypus2-7b-ex | `[NaN]` |
 **Tokenizing "Platypus: Quick, Cheap, and Powerful Refinement of LLMs"**
 | Model | Tokens |
 | --- | --- |
-| Platypus2-7b | `[NaN]` |
-| KO-Platypus2-7b-ex | `[NaN]` |
 # **Model Benchmark**

 | Model Name | Vocabulary Size | Description |
 | --- | --- | --- |
+| Original Platypus2 | 32000 | Sentencepiece BPE |
+| **Expanded KO-Platypus-ex** | 46336 | Sentencepiece BPE. Added Korean vocab and merges |
 **Tokenizing "안녕하세요, 오늘은 날씨가 좋네요."**
 | Model | Tokens |
 | --- | --- |
+| Platypus2-7b | `['▁', '안', '<0xEB>', '<0x85>', '<0x95>', '하', '세', '요', ',', '▁', '오', '<0xEB>', '<0x8A>', '<0x98>', '은', '▁', '<0xEB>', '<0x82>', '<0xA0>', '씨', '가', '▁', '<0xEC>', '<0xA2>', '<0x8B>', '<0xEB>', '<0x84>', '<0xA4>', '요', '.']` |
+| KO-Platypus2-7b-ex | `['▁안녕', '하세요', ',', '▁오늘은', '▁날', '씨가', '▁좋네요', '.']` |
 **Tokenizing "Platypus: Quick, Cheap, and Powerful Refinement of LLMs"**
 | Model | Tokens |
 | --- | --- |
+| Platypus2-7b | `['▁Plat', 'yp', 'us', ':', '▁Quick', ',', '▁Che', 'ap', ',', '▁and', '▁Power', 'ful', '▁Re', 'fin', 'ement', '▁of', '▁L', 'LM', 's']` |
+| KO-Platypus2-7b-ex | `[▁Plat', 'yp', 'us', ':', '▁Quick', ',', '▁Che', 'ap', ',', '▁and', '▁Power', 'ful', '▁Re', 'fin', 'ement', '▁of', '▁L', 'LM', 's']` |
 # **Model Benchmark**