Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

lysandreΒ 
posted an update 1 day ago
view post
Post
2392
SmolVLM-2 and SigLIP-2 are now part of transformers in dedicated releases!

They're added on top of the v4.49.0 release, and can be installed from the following tags: v4.49.0-SmolVLM-2 and v4.49.0-SigLIP-2.

This marks a new beginning for the release process of transformers. For the past five years, we've been doing monthly releases featuring many models (v4.49.0, the latest release, features 9 new architectures).

Starting with SmolVLM-2 & SigLIP2, we'll now additionally release tags supporting new models on a stable branch. These models are therefore directly available for use by installing from the tag itself. These tags will continue to be updated with fixes applied to these models.

Going forward, continue expecting software releases following semantic versioning: v4.50.0 will have ~10 new architectures compared to v4.49.0, as well as a myriad of new features, improvements and bug fixes. Accompanying these software releases, we'll release tags offering brand new models as fast as possible, to make them accessible to all immediately.
  • 1 reply
Β·
DmitryRyuminΒ 
posted an update 2 days ago
view post
Post
2866
πŸš€πŸŽ­πŸŒŸ New Research Alert - WACV 2025 (Avatars Collection)! πŸŒŸπŸŽ­πŸš€
πŸ“„ Title: EmoVOCA: Speech-Driven Emotional 3D Talking Heads πŸ”

πŸ“ Description: EmoVOCA is a data-driven method for generating emotional 3D talking heads by combining speech-driven lip movements with expressive facial dynamics. This method has been developed to overcome the limitations of corpora and to achieve state-of-the-art animation quality.

πŸ‘₯ Authors: @FedeNoce , Claudio Ferrari, and Stefano Berretti

πŸ“… Conference: WACV, 28 Feb – 4 Mar, 2025 | Arizona, USA πŸ‡ΊπŸ‡Έ

πŸ“„ Paper: https://arxiv.org/abs/2403.12886

🌐 Github Page: https://fedenoce.github.io/emovoca/
πŸ“ Repository: https://github.com/miccunifi/EmoVOCA

πŸš€ CVPR-2023-24-Papers: https://github.com/DmitryRyumin/CVPR-2023-24-Papers

πŸš€ WACV-2024-Papers: https://github.com/DmitryRyumin/WACV-2024-Papers

πŸš€ ICCV-2023-Papers: https://github.com/DmitryRyumin/ICCV-2023-Papers

πŸ“š More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

πŸš€ Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

πŸ” Keywords: #EmoVOCA #3DAnimation #TalkingHeads #SpeechDriven #FacialExpressions #MachineLearning #ComputerVision #ComputerGraphics #DeepLearning #AI #WACV2024
  • 1 reply
Β·
jsulzΒ 
posted an update 1 day ago
view post
Post
2161
Time flies!

Six months after joining Hugging Face the Xet team is kicking off the first migrations from LFS to our storage for a number of repositories on the Hub.

More on the nitty gritty details behind the migration soon, but here are the big takeaways:

πŸ€– We've successfully completed the first migrations from LFS -> Xet to test the infrastructure and prepare for a wider release

βœ… No action on your part needed - you can work with a Xet-backed repo like any other repo on the Hub (for now - major improvements on their way!)

πŸ‘€ Keep an eye out for the Xet logo to see if a repo you know is on our infra! See the screenshots below to spot the difference πŸ‘‡

⏩ ⏩ ⏩ Blazing uploads and downloads coming soon. W’re gearing up for a full integration with the Hub's Python library that will make building on the Hub faster than ever - special thanks to @celinah and @Wauplin for their assistance.

πŸŽ‰ Want Early Access? If you’re curious and want to test it out the bleeding edge that will power the development experience on the Hub, we’d love to partner with you. Let me know!

This is the culmination of a lot of effort from the entire team. Big round of applause to @sirahd @brianronan @jgodlewski @hoytak @seanses @assafvayner @znation @saba9 @rajatarya @port8080 @yuchenglow
  • 1 reply
Β·
JingzeShiΒ 
posted an update 1 day ago
MonsterMMORPGΒ 
posted an update 2 days ago
view post
Post
1931
IDM VTON : Virtual Try On APP Automatic Installers for Windows, RunPod, Massed Compute and a free Kaggle Account notebook Published - Can transfer objects too

Installers & APP
1-Click installers for Windows, RunPod, Massed Compute and a free Kaggle account notebook in below link:

https://www.patreon.com/posts/122718239

Features

Seamlessly install on Windows, RunPod, Massed Compute and on Kaggle with just 1-click into a Python 3.10 VENV

Our APP has so many extra features

Can perfectly handle any resolution and aspect ratio images

You can perfectly manually mask via latest version of Gradio and properly working image editor

Supports 4-bit, 8-bit quantization + CPU offloading for lower VRAM GPUs
All generated images are also automatically saved

You can also generate more than 1 image like 10 images as batch generation with order

Official repo : https://idm-vton.github.io/
  • 2 replies
Β·
ychenΒ 
posted an update 1 day ago
view post
Post
1466
Here's some annoying keywords that 4o tends to use when responding to personal experiences with negative sentiments. Will be updated over time.

rough, tough, sound like, sounds like, frustrating, overwhelming
  • 2 replies
Β·
merveΒ 
posted an update 3 days ago
view post
Post
4646
Google just released PaliGemma 2 Mix: new versatile instruction vision language models πŸ”₯

> Three new models: 3B, 10B, 28B with res 224, 448 πŸ’™
> Can do vision language tasks with open-ended prompts, understand documents, and segment or detect anything 🀯

Read more https://huggingface.co/blog/paligemma2mix
Try the demo google/paligemma2-10b-mix
All models are here google/paligemma-2-mix-67ac6a251aaf3ee73679dcc4
fdaudensΒ 
posted an update about 20 hours ago
tegridydevΒ 
posted an update 1 day ago
view post
Post
1483
Open Source AI Agents | Github/Repo List | [2025]

https://huggingface.co/blog/tegridydev/open-source-ai-agents-directory

Check out the article & Follow, bookmark, save the tab as I will be updating it <3
(using it as my own notepad & decided i might keep it up to date if i post it here, instead of making the 15th_version of it and not saving it with a name i can remember on my desktop lol)
onekqΒ 
posted an update 1 day ago
view post
Post
1524
Still waiting for πŸ‘½GrokπŸ‘½ 3 API βŒ›πŸ˜žπŸ˜«