This is the OpenVINO accelerated version for GOT-OCR2.0. To use this model, download all files from the origin repo stepfun-ai/GOT-OCR2_0 and copy everything to the weight folder. The file structure should look like this:
.
β app.py
β convert_model.py
ββweight
β config.json
β generation_config.json
β got_vision_b.py
β modeling_GOT.py
β openvino_language_model.bin
β openvino_language_model.xml
β openvino_text_embeddings_model.bin
β openvino_text_embeddings_model.xml
β openvino_vision_embeddings_merger_model.bin
β openvino_vision_embeddings_merger_model.xml
β openvino_vision_embeddings_model.bin
β openvino_vision_embeddings_model.xml
β qwen.tiktoken
β render_tools.py
β special_tokens_map.json
β tokenization_qwen.json
β tokenizer_config.json
Libraries require:
pip install "openvino" "torch" "transformers" "torchvision" "Pillow" "nncf" "requests" "numpy"
Simply running the following command
python app.py --image-file /path/to/image
For more instruction, refer to GitHub Page
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.
Model tree for can-gaa-hou/GOT-OCR2.0-OpenVINO-INT4
Base model
stepfun-ai/GOT-OCR2_0