EntityNet
Collection
The models of our publication: Using Knowledge Graphs to harvest datasets for efficient CLIP model training
•
7 items
•
Updated
A CLIP (Contrastive Language-Image Pre-training) model trained from scratch on the LivingThings-10M subset of the EntityNet-33M dataset.
See the project page for the paper, code, usage examples, metrics, etc.
The model has seen ~0.2B images at a batch size of 8k.