First commit

Browse files

Files changed (8) hide show

REAME.md +95 -0
diff2lip/Diff2Lip.pth +3 -0
fomm/vox-cpk.pth +3 -0
gfpgan/GFPGANv1.3.pth +3 -0
sadtalker/SadTalker_V0.0.2_256.safetensors +3 -0
sadtalker/epoch_20.pth +3 -0
sadtalker/sadtalker.pth +3 -0
wav2lip/wav2lip_gan.pth +3 -0

REAME.md ADDED Viewed

	@@ -0,0 +1,95 @@

+# Avatar‑Renderer Checkpoints
+This repository bundles all pretrained model checkpoints required by the [Avatar Renderer MCP](https://github.com/ruslanmv/avatar-renderer-mcp) pipeline.
+**VideoGenie Avatar Generator** is a single‑image → talking‑head engine that ships an MCP‑native stdio server (`render_avatar` tool) and a FastAPI REST façade in one CUDA container. Drop it into any GPU pool and your MCP Gateway auto‑discovers it on boot.
+This model‑hub repo allows you to fetch **all** necessary checkpoints from a **single source** via Git LFS or the Hugging Face Hub API.
+---
+## Directory structure
+```
+├── diff2lip
+│   └── Diff2Lip.pth                # Audio‑to‑lip Diffusion model
+├── fomm
+│   └── vox-cpk.pth                 # First‑Order‑Motion vox‑cpk checkpoint
+├── gfpgan
+│   └── GFPGANv1.3.pth              # GFPGAN v1.3 face enhancement model
+├── sadtalker
+│   ├── SadTalker_V0.0.2_256.safetensors  # Safetensors release bundle
+│   ├── epoch_20.pth               # Training checkpoint (epoch 20)
+│   └── sadtalker.pth              # Legacy binary checkpoint
+└── wav2lip
+    └── wav2lip_gan.pth            # Wav2Lip GAN audio-to-lip model
+```
+Each subfolder contains one or more formats of the same model, ensuring compatibility with different inference pipelines.
+---
+## Usage
+### 1. Clone via Git LFS
+```bash
+# Ensure Git LFS is installed:
+#   https://git-lfs.github.com/
+git clone https://huggingface.co/ruslanmv/avatar-renderer
+cd avatar-renderer
+# You'll now have a `models/` tree matching the structure above.
+```
+### 2. Download via Python (Hugging Face Hub API)
+```python
+from huggingface_hub import snapshot_download
+# Download all files into ./models-cache
+models_dir = snapshot_download(
+    repo_id="ruslanmv/avatar-renderer",
+    cache_dir="./models-cache",
+)
+print("Checkpoints downloaded to:", models_dir)
+```
+### 3. Integrate with Avatar Renderer MCP
+In your **Avatar Renderer MCP** project, configure the checkpoint environment variables to point at the local `models` directory:
+```bash
+export FOMM_CKPT_DIR=/path/to/avatar-renderer/fomm
+export DIFF2LIP_CKPT=/path/to/avatar-renderer/diff2lip/Diff2Lip.pth
+export SADTALKER_CKPT_DIR=/path/to/avatar-renderer/sadtalker
+export WAV2LIP_CKPT=/path/to/avatar-renderer/wav2lip/wav2lip_gan.pth
+export GFPGAN_CKPT=/path/to/avatar-renderer/gfpgan/GFPGANv1.3.pth
+```
+Alternatively, mount the entire repo into `/models` inside a Docker container:
+```dockerfile
+FROM ruslanmv/avatar-renderer-mcp:latest
+COPY --from=ruslanmv/avatar-renderer /models /models
+CMD ["uvicorn", "app.api:app", "--host", "0.0.0.0", "--port", "8000"]
+```
+---
+## License
+This repository collects checkpoints that were released under their respective open licenses:
+* **FOMM**: [Apache‑2.0](https://github.com/AliaksandrSiarohin/first-order-model/blob/master/LICENSE)
+* **Diff2Lip**: [MIT](https://github.com/YuanGary/DiffusionLi/blob/main/LICENSE)
+* **SadTalker**: [Apache‑2.0](https://github.com/Winfredy/SadTalker/blob/main/LICENSE)
+* **Wav2Lip**: [MIT](https://github.com/Rudrabha/Wav2Lip/blob/master/LICENSE)
+* **GFPGAN**: [MIT](https://github.com/TencentARC/GFPGAN/blob/main/LICENSE)
+Please refer to each upstream project for full license details.
+---
+> Maintained by [ruslanmv](https://github.com/ruslanmv).
+> Part of the [Avatar Renderer MCP](https://github.com/ruslanmv/avatar-renderer-mcp) ecosystem.

diff2lip/Diff2Lip.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8c71166482d2b893f2f77450563a1bb31d805f3048c7213b974fd9201e9aa4b3
+size 406815527

fomm/vox-cpk.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:abb41ab1f279f26326c0d6e4d20702de6658364665aa1313daa7a63e89ea2b23
+size 728766691

gfpgan/GFPGANv1.3.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c953a88f2727c85c3d9ae72e2bd4846bbaf59fe6972ad94130e23e7017524a70
+size 348632874

sadtalker/SadTalker_V0.0.2_256.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c211f5d6de003516bf1bbda9f47049a4c9c99133b1ab565c6961e5af16477bff
+size 725066984

sadtalker/epoch_20.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6d17a6b23457b521801baae583cb6a58f7238fe6721fc3d65d76407460e9149b
+size 288860037

sadtalker/sadtalker.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6d17a6b23457b521801baae583cb6a58f7238fe6721fc3d65d76407460e9149b
+size 288860037

wav2lip/wav2lip_gan.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ca9ab7b7b812c0e80a6e70a5977c545a1e8a365a6c49d5e533023c034d7ac3d8
+size 435801865