Manirathinam21
/

DistilBert_SMSSpam_classifier

Text Classification

generated_from_keras_callback

Model card Files Files and versions

Manirathinam21 commited on Aug 17, 2022

Commit

b3f9c00

·

1 Parent(s): c6960c1

comments updated

Files changed (1) hide show

README.md +12 -2

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ probably proofread and complete it, then remove this comment. -->
 # Manirathinam21/DistilBert_SMSSpam_classifier
-This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Train Loss: 0.0114
 - Train Accuracy: 0.9962
@@ -26,7 +26,13 @@ label: a classification label, with possible values including
 ## Model description
-More information needed
 ## Intended uses & limitations
@@ -38,6 +44,10 @@ More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 # Manirathinam21/DistilBert_SMSSpam_classifier
+This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an SMSSpam Detection dataset.
 It achieves the following results on the evaluation set:
 - Train Loss: 0.0114
 - Train Accuracy: 0.9962
 ## Model description
+Tokenizer used is DistilBertTokenizerFast  with return_tensors='tf' parameter in tokenizer because building model in a tensorflow framework
+Model: TFDistilBertForSequenceClassification
+Optimizer: Adam with learning rate=5e-5
+Loss: SparseCategoricalCrossentropy
 ## Intended uses & limitations
 ## Training procedure
+After Tokenized, Encoded datasets are converted to Dataset Objects by using tf.data.Dataset.from_tensor_slices((dict(train_encoding),  train_y))
+This step is done to inject a dataset into TFModel in a specific TF format
 ### Training hyperparameters
 The following hyperparameters were used during training: