metadata

license: apache-2.0
datasets:
  - imagenet-1k
  - ade20k
metrics:
  - accuracy
  - mIoU
pipeline_tag: image-classification

Pyramid-VisionLLaMA-B

How to Use

Please refer the Github page for usage.

Citation

@article{chu2024visionllama,
  title={VisionLLaMA: A Unified LLaMA Interface for Vision Tasks},
  author={Chu, Xiangxiang and Su, Jianlin and Zhang, Bo and Shen, Chunhua},
  journal={arXiv preprint arXiv:2403.00522},
  year={2024}
}