1 4 48

Haolei

holyhigh666

AI & ML interests

None yet

Recent Activity

upvoted an article about 24 hours ago

Small Language Models (SLMs): A Comprehensive Overview

updated a Space 1 day ago

holyhigh666/Paddle-OCR

reacted to lysandre's post with ❤️ 2 days ago

SmolVLM-2 and SigLIP-2 are now part of `transformers` in dedicated releases! They're added on top of the v4.49.0 release, and can be installed from the following tags: `v4.49.0-SmolVLM-2` and `v4.49.0-SigLIP-2`. This marks a new beginning for the release process of transformers. For the past five years, we've been doing monthly releases featuring many models (v4.49.0, the latest release, features 9 new architectures). Starting with SmolVLM-2 & SigLIP2, we'll now additionally release tags supporting new models on a stable branch. These models are therefore directly available for use by installing from the tag itself. These tags will continue to be updated with fixes applied to these models. Going forward, continue expecting software releases following semantic versioning: v4.50.0 will have ~10 new architectures compared to v4.49.0, as well as a myriad of new features, improvements and bug fixes. Accompanying these software releases, we'll release tags offering brand new models as fast as possible, to make them accessible to all immediately.

View all activity

Organizations

None yet

holyhigh666's activity

liked a Space 3 days ago

VLM R1 Referral Expression

💬

Highlight described objects in images

liked a Space 9 days ago

Ovis2 16B

🦫

See, read, and reason—better together.

liked a Space 16 days ago

196

BiRefNet Demo

🐠

Extract and mask subjects from images

liked a Space 20 days ago

306

TTS Spaces Arena

🤗

Blind vote on HF TTS models!

liked a model 22 days ago

m-a-p/YuE-s1-7B-anneal-en-cot

Text Generation • Updated 24 days ago • 112k • 376

liked 5 Spaces 22 days ago

Open SUNO

👩

Your Lyrics into Complete Songs with Vocals in Multilingual

1.83k

FacePoke

🙂

Import a portrait, click to move the head!

3.07k

Live Portrait

🤪

Apply the motion of a video on a portrait

1.81k

Chat With Janus-Pro-7B

🌍

A unified multimodal understanding and generation model.

Qwen2.5 VL 72B Instruct

💻

Interact with Qwen2.5-VL-72B to get responses and generate images

liked a Space 24 days ago

YOLOv10 Document Layout Analysis

🏆

Analyze scanned documents to detect and label content

liked a Space 25 days ago

220

Seed Voice Conversion

🎤

Convert voice to match another using reference audio

liked a Space 26 days ago

Qwen2.5-1M Demo

💻

Upload documents to answer questions

liked a Space 27 days ago

Janus Pro 7b

🌍

A unified multimodal understanding and generation model.

liked a model about 1 month ago

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • Updated Dec 2, 2024 • 114k • 391

liked 2 Spaces about 1 month ago

275

Llasa 3b Tts

🔥

Zero Shot voice cloning with llasa 3b (Unofficial Demo)

338

LatentSync

👄

Audio Conditioned LipSync with Latent Diffusion Models

liked 3 Spaces about 2 months ago

1.79k

Voice Clone

🗣

Clone voice to say text

2.39k

XTTS

🐸

1.35k

Background Removal

🌘

Remove backgrounds from images