pawasthy commited on
Commit
cb5db1f
·
verified ·
1 Parent(s): 2d1e268

Upload 2 files

Browse files
Files changed (2) hide show
  1. README.md +7 -1
  2. model.onnx +3 -0
README.md CHANGED
@@ -16829,13 +16829,19 @@ pipeline_tag: sentence-similarity
16829
  ---
16830
  # Granite-Embedding-30m-English
16831
 
 
 
 
 
 
 
16832
  **Model Summary:**
16833
  Granite-Embedding-30m-English is a 30M parameter dense biencoder embedding model from the Granite Embeddings suite that can be used to generate high quality text embeddings. This model produces embedding vectors of size 384 and is trained using a combination of open source relevance-pair datasets with permissive, enterprise-friendly license, and IBM collected and generated datasets. While maintaining competitive scores on academic benchmarks such as BEIR, this model also performs well on many enterprise use cases. This model is developed using retrieval oriented pretraining, contrastive finetuning, knowledge distillation and model merging for improved performance.
16834
 
16835
  - **Developers:** Granite Embedding Team, IBM
16836
  - **GitHub Repository:** [ibm-granite/granite-embedding-models](https://github.com/ibm-granite/granite-embedding-models)
16837
  - **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
16838
- - **Paper:** Coming Soon
16839
  - **Release Date**: December 18th, 2024
16840
  - **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
16841
 
 
16829
  ---
16830
  # Granite-Embedding-30m-English
16831
 
16832
+ **News:**
16833
+ Granite Embedding R2 models with 8192 context length released.
16834
+
16835
+ - [granite-embedding-english-r2](https://huggingface.co/ibm-granite/granite-embedding-english-r2) (149M parameters): with an output embedding size of 768, replacing granite-embedding-125m-english.
16836
+ - [granite-embedding-small-english-r2](https://huggingface.co/ibm-granite/granite-embedding-small-english-r2) (47M parameters): A first-of-its-kind reduced-size model, with fewer layers and a smaller output embedding size (384), replacing granite-embedding-30m-english.
16837
+
16838
  **Model Summary:**
16839
  Granite-Embedding-30m-English is a 30M parameter dense biencoder embedding model from the Granite Embeddings suite that can be used to generate high quality text embeddings. This model produces embedding vectors of size 384 and is trained using a combination of open source relevance-pair datasets with permissive, enterprise-friendly license, and IBM collected and generated datasets. While maintaining competitive scores on academic benchmarks such as BEIR, this model also performs well on many enterprise use cases. This model is developed using retrieval oriented pretraining, contrastive finetuning, knowledge distillation and model merging for improved performance.
16840
 
16841
  - **Developers:** Granite Embedding Team, IBM
16842
  - **GitHub Repository:** [ibm-granite/granite-embedding-models](https://github.com/ibm-granite/granite-embedding-models)
16843
  - **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
16844
+ - **Paper:** [Technical Report](https://arxiv.org/abs/2502.20204)
16845
  - **Release Date**: December 18th, 2024
16846
  - **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
16847
 
model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3a85876825c5144a646dd4da8d05271e2536774481849e8825102ca2126770fc
3
+ size 121306933