|
--- |
|
license: apache-2.0 |
|
base_model: |
|
- apple/aimv2-large-patch14-448 |
|
pipeline_tag: zero-shot-object-detection |
|
tags: |
|
- t5gemma |
|
- vision |
|
--- |
|
|
|
# TMoon |
|
|
|
- 3D positional embedding (as in the [text encoder replacement](https://huggingface.co/sugarquark/sd15-text-encoder-t5g-2b-ul2-it), [object detection](https://huggingface.co/twodgirl/rotated-position-in-latent-space-fashion-object-detection) and others) |
|
- Supports multiple image aspect ratios |
|
- Contrastive prediction model |
|
- AIMv2 vision model |
|
- [T5Gemma](https://huggingface.co/google/t5gemma-2b-2b-ul2-it) text encoder |
|
- [Moondream2](https://huggingface.co/vikhyatk/moondream2/tree/2025-06-21) [dataset](https://huggingface.co/datasets/sugarquark/moonpixs) |
|
|
|
 |