Manirathinam21 commited on
Commit
b3f9c00
·
1 Parent(s): c6960c1

comments updated

Browse files
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -12,7 +12,7 @@ probably proofread and complete it, then remove this comment. -->
12
 
13
  # Manirathinam21/DistilBert_SMSSpam_classifier
14
 
15
- This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
  - Train Loss: 0.0114
18
  - Train Accuracy: 0.9962
@@ -26,7 +26,13 @@ label: a classification label, with possible values including
26
 
27
  ## Model description
28
 
29
- More information needed
 
 
 
 
 
 
30
 
31
  ## Intended uses & limitations
32
 
@@ -38,6 +44,10 @@ More information needed
38
 
39
  ## Training procedure
40
 
 
 
 
 
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
 
12
 
13
  # Manirathinam21/DistilBert_SMSSpam_classifier
14
 
15
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an SMSSpam Detection dataset.
16
  It achieves the following results on the evaluation set:
17
  - Train Loss: 0.0114
18
  - Train Accuracy: 0.9962
 
26
 
27
  ## Model description
28
 
29
+ Tokenizer used is DistilBertTokenizerFast with return_tensors='tf' parameter in tokenizer because building model in a tensorflow framework
30
+
31
+ Model: TFDistilBertForSequenceClassification
32
+
33
+ Optimizer: Adam with learning rate=5e-5
34
+
35
+ Loss: SparseCategoricalCrossentropy
36
 
37
  ## Intended uses & limitations
38
 
 
44
 
45
  ## Training procedure
46
 
47
+ After Tokenized, Encoded datasets are converted to Dataset Objects by using tf.data.Dataset.from_tensor_slices((dict(train_encoding), train_y))
48
+
49
+ This step is done to inject a dataset into TFModel in a specific TF format
50
+
51
  ### Training hyperparameters
52
 
53
  The following hyperparameters were used during training: