Updata README.md
Browse files
README.md
CHANGED
@@ -18,8 +18,9 @@ language:
|
|
18 |
## Introduction
|
19 |
[GitHub](https://github.com/AIDC-AI/Ovis) | [Demo](https://huggingface.co/spaces/AIDC-AI/Ovis1.6-Llama3.2-3B) | [Paper](https://arxiv.org/abs/2405.20797)
|
20 |
|
|
|
21 |
|
22 |
-
|
23 |
|
24 |
<div align="center">
|
25 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/TIlymOb86R6_Mez3bpmcB.png" width="100%" />
|
@@ -46,6 +47,9 @@ Below is a code snippet to run Ovis with multimodal inputs. For additional usage
|
|
46 |
```bash
|
47 |
pip install torch==2.2.0 transformers==4.44.2 numpy==1.24.3 pillow==10.3.0
|
48 |
```
|
|
|
|
|
|
|
49 |
```python
|
50 |
import torch
|
51 |
from PIL import Image
|
|
|
18 |
## Introduction
|
19 |
[GitHub](https://github.com/AIDC-AI/Ovis) | [Demo](https://huggingface.co/spaces/AIDC-AI/Ovis1.6-Llama3.2-3B) | [Paper](https://arxiv.org/abs/2405.20797)
|
20 |
|
21 |
+
We are thrilled to announce the open-sourcing of **Ovis1.6-Llama3.2-3B**, an integral part of the Ovis1.6 family. This cutting-edge model currently sets the benchmark as the state-of-the-art (SOTA) solution for edge-side multimodal tasks.
|
22 |
|
23 |
+
The Ovis family employs an innovative Multimodal Large Language Model (MLLM) architecture, specifically designed to structurally align visual and textual embeddings. Ovis1.6-Llama3.2-3B excels in common industry benchmarks, surpassing numerous open-source and proprietary multimodal models. Moreover, it is also particularly well-suited for local intelligence, on-device computing, and edge computing scenarios.
|
24 |
|
25 |
<div align="center">
|
26 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/TIlymOb86R6_Mez3bpmcB.png" width="100%" />
|
|
|
47 |
```bash
|
48 |
pip install torch==2.2.0 transformers==4.44.2 numpy==1.24.3 pillow==10.3.0
|
49 |
```
|
50 |
+
```bash
|
51 |
+
pip install flash-attn --no-build-isolation
|
52 |
+
```
|
53 |
```python
|
54 |
import torch
|
55 |
from PIL import Image
|