xxyyy123 commited on
Commit
7f4d6cd
·
1 Parent(s): bd81d21

Updata README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -18,8 +18,9 @@ language:
18
  ## Introduction
19
  [GitHub](https://github.com/AIDC-AI/Ovis) | [Demo](https://huggingface.co/spaces/AIDC-AI/Ovis1.6-Llama3.2-3B) | [Paper](https://arxiv.org/abs/2405.20797)
20
 
 
21
 
22
- We are excited to announce the open-sourcing of **Ovis-1.6**, our latest multi-modal large language model. Ovis is a novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
23
 
24
  <div align="center">
25
  <img src="https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/TIlymOb86R6_Mez3bpmcB.png" width="100%" />
@@ -46,6 +47,9 @@ Below is a code snippet to run Ovis with multimodal inputs. For additional usage
46
  ```bash
47
  pip install torch==2.2.0 transformers==4.44.2 numpy==1.24.3 pillow==10.3.0
48
  ```
 
 
 
49
  ```python
50
  import torch
51
  from PIL import Image
 
18
  ## Introduction
19
  [GitHub](https://github.com/AIDC-AI/Ovis) | [Demo](https://huggingface.co/spaces/AIDC-AI/Ovis1.6-Llama3.2-3B) | [Paper](https://arxiv.org/abs/2405.20797)
20
 
21
+ We are thrilled to announce the open-sourcing of **Ovis1.6-Llama3.2-3B**, an integral part of the Ovis1.6 family. This cutting-edge model currently sets the benchmark as the state-of-the-art (SOTA) solution for edge-side multimodal tasks.
22
 
23
+ The Ovis family employs an innovative Multimodal Large Language Model (MLLM) architecture, specifically designed to structurally align visual and textual embeddings. Ovis1.6-Llama3.2-3B excels in common industry benchmarks, surpassing numerous open-source and proprietary multimodal models. Moreover, it is also particularly well-suited for local intelligence, on-device computing, and edge computing scenarios.
24
 
25
  <div align="center">
26
  <img src="https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/TIlymOb86R6_Mez3bpmcB.png" width="100%" />
 
47
  ```bash
48
  pip install torch==2.2.0 transformers==4.44.2 numpy==1.24.3 pillow==10.3.0
49
  ```
50
+ ```bash
51
+ pip install flash-attn --no-build-isolation
52
+ ```
53
  ```python
54
  import torch
55
  from PIL import Image