Update README.md
Browse files
README.md
CHANGED
|
@@ -5,7 +5,7 @@ license: cc-by-nc-4.0
|
|
| 5 |
|
| 6 |
## ProkBERT-mini Model
|
| 7 |
|
| 8 |
-
ProkBERT-mini-k6s1 is part of the ProkBERT family of genomic language models, specifically designed for microbiome applications. This model, optimized for DNA sequence analysis
|
| 9 |
|
| 10 |
## Simple Usage Example
|
| 11 |
|
|
@@ -22,7 +22,7 @@ tokenization_parameters = {
|
|
| 22 |
}
|
| 23 |
# Initialize the tokenizer and model
|
| 24 |
tokenizer = ProkBERTTokenizer(tokenization_params=tokenization_parameters, operation_space='sequence')
|
| 25 |
-
model = MegatronBertForMaskedLM.from_pretrained("
|
| 26 |
# Example DNA sequence
|
| 27 |
sequence = 'ATGTCCGCGGGACCT'
|
| 28 |
# Tokenize the sequence
|
|
@@ -152,7 +152,7 @@ except ImportError:
|
|
| 152 |
|
| 153 |
*Promoter prediction performance metrics on a diverse test set. A comparative analysis of various promoter prediction tools, showcasing their performance across key metrics including accuracy, F1 score, MCC, sensitivity, and specificity.*
|
| 154 |
|
| 155 |
-
### Evaluation on phage
|
| 156 |
|
| 157 |
| method | L | auc_class1 | acc | f1 | mcc | recall | sensitivity | specificity | tn | fp | fn | tp | Np | Nn | eval_time |
|
| 158 |
|:--------------|-----:|-------------:|---------:|---------:|---------:|---------:|--------------:|--------------:|-----:|-----:|-----:|-----:|------:|------:|------------:|
|
|
|
|
| 5 |
|
| 6 |
## ProkBERT-mini Model
|
| 7 |
|
| 8 |
+
ProkBERT-mini-k6s1 is part of the ProkBERT family of genomic language models, specifically designed for microbiome applications. This model, optimized for DNA sequence analysis, can provide robust and high resolution solutions.
|
| 9 |
|
| 10 |
## Simple Usage Example
|
| 11 |
|
|
|
|
| 22 |
}
|
| 23 |
# Initialize the tokenizer and model
|
| 24 |
tokenizer = ProkBERTTokenizer(tokenization_params=tokenization_parameters, operation_space='sequence')
|
| 25 |
+
model = MegatronBertForMaskedLM.from_pretrained("neuralbioinfo/prokbert-mini-k6s1")
|
| 26 |
# Example DNA sequence
|
| 27 |
sequence = 'ATGTCCGCGGGACCT'
|
| 28 |
# Tokenize the sequence
|
|
|
|
| 152 |
|
| 153 |
*Promoter prediction performance metrics on a diverse test set. A comparative analysis of various promoter prediction tools, showcasing their performance across key metrics including accuracy, F1 score, MCC, sensitivity, and specificity.*
|
| 154 |
|
| 155 |
+
### Evaluation on phage recognition benchmark
|
| 156 |
|
| 157 |
| method | L | auc_class1 | acc | f1 | mcc | recall | sensitivity | specificity | tn | fp | fn | tp | Np | Nn | eval_time |
|
| 158 |
|:--------------|-----:|-------------:|---------:|---------:|---------:|---------:|--------------:|--------------:|-----:|-----:|-----:|-----:|------:|------:|------------:|
|