Update README.md
Browse files
README.md
CHANGED
@@ -8108,7 +8108,7 @@ language:
|
|
8108 |
|
8109 |
# [bilingual-embedding-small](https://huggingface.co/Lajavaness/bilingual-embedding-small)
|
8110 |
|
8111 |
-
Bilingual-embedding is the Embedding Model for bilingual language: french and english. This model is a specialized sentence-embedding trained specifically for the bilingual language, leveraging the robust capabilities of [Multilingual-MiniLM-L12-H384](https://huggingface.co/microsoft/Multilingual-MiniLM-L12-H384), a pre-trained language model is built upon [multilingual-e5](https://huggingface.co/intfloat/multilingual-e5-small) architecture. The model utilizes MiniLM to encode english-french sentences into a
|
8112 |
|
8113 |
|
8114 |
## Full Model Architecture
|
@@ -8148,6 +8148,10 @@ from sentence_transformers import SentenceTransformer
|
|
8148 |
sentences = ["Paris est une capitale de la France", "Paris is a capital of France"]
|
8149 |
|
8150 |
model = SentenceTransformer('Lajavaness/bilingual-embedding-small', trust_remote_code=True)
|
|
|
|
|
|
|
|
|
8151 |
print(embeddings)
|
8152 |
|
8153 |
```
|
|
|
8108 |
|
8109 |
# [bilingual-embedding-small](https://huggingface.co/Lajavaness/bilingual-embedding-small)
|
8110 |
|
8111 |
+
Bilingual-embedding is the Embedding Model for bilingual language: french and english. This model is a specialized sentence-embedding trained specifically for the bilingual language, leveraging the robust capabilities of [Multilingual-MiniLM-L12-H384](https://huggingface.co/microsoft/Multilingual-MiniLM-L12-H384), a pre-trained language model is built upon [multilingual-e5](https://huggingface.co/intfloat/multilingual-e5-small) architecture. The model utilizes MiniLM to encode english-french sentences into a 384-dimensional vector space, facilitating a wide range of applications from semantic search to text clustering. The embeddings capture the nuanced meanings of english-french sentences, reflecting both the lexical and contextual layers of the language.
|
8112 |
|
8113 |
|
8114 |
## Full Model Architecture
|
|
|
8148 |
sentences = ["Paris est une capitale de la France", "Paris is a capital of France"]
|
8149 |
|
8150 |
model = SentenceTransformer('Lajavaness/bilingual-embedding-small', trust_remote_code=True)
|
8151 |
+
|
8152 |
+
|
8153 |
+
embeddings = model.encode(sentences)
|
8154 |
+
|
8155 |
print(embeddings)
|
8156 |
|
8157 |
```
|