35 7 75

enzo PRO

enzostvs

https://en-zo.dev

AI & ML interests

here to make beautiful things

Recent Activity

liked a Space 5 days ago

Trudy/gemini-realtime-dots

reacted to merve's post with 👍 9 days ago

Your weekly recap of open AI is here, and it's packed with models! https://huggingface.co/collections/merve/feb-14-releases-67af876b404cc27c6d837767 👀 Multimodal > OpenGVLab released InternVideo 2.5 Chat models, new video LMs with long context > AIDC released Ovis2 model family along with Ovis dataset, new vision LMs in different sizes (1B, 2B, 4B, 8B, 16B, 34B), with video and OCR support > ColQwenStella-2b is a multilingual visual retrieval model that is sota in it's size > Hoags-2B-Exp is a new multilingual vision LM with contextual reasoning, long context video understanding 💬 LLMs A lot of math models! > Open-R1 team released OpenR1-Math-220k large scale math reasoning dataset, along with Qwen2.5-220K-Math fine-tuned on the dataset, OpenR1-Qwen-7B > Nomic AI released new Nomic Embed multilingual retrieval model, a MoE with 500 params with 305M active params, outperforming other models > DeepScaleR-1.5B-Preview is a new DeepSeek-R1-Distill fine-tune using distributed RL on math > LIMO is a new fine-tune of Qwen2.5-32B-Instruct on Math 🗣️ Audio > Zonos-v0.1 is a new family of speech recognition models, which contains the model itself and embeddings 🖼️ Vision and Image Generation > We have ported DepthPro of Apple to transformers for your convenience! > illustrious-xl-v1.0 is a new illustration generation model

liked a Space 9 days ago

le-leadboard/OpenLLMFrenchLeaderboard

View all activity

Organizations

enzostvs's activity

liked a Space 5 days ago

Gemini Live API Demo - 3 Dots

🟢

Drag and move colorful circles on a canvas

reacted to merve's post with 👍 9 days ago

Post

4593

Your weekly recap of open AI is here, and it's packed with models! merve/feb-14-releases-67af876b404cc27c6d837767

👀 Multimodal
> OpenGVLab released InternVideo 2.5 Chat models, new video LMs with long context
> AIDC released Ovis2 model family along with Ovis dataset, new vision LMs in different sizes (1B, 2B, 4B, 8B, 16B, 34B), with video and OCR support
> ColQwenStella-2b is a multilingual visual retrieval model that is sota in it's size
> Hoags-2B-Exp is a new multilingual vision LM with contextual reasoning, long context video understanding

💬 LLMs
A lot of math models!
> Open-R1 team released OpenR1-Math-220k large scale math reasoning dataset, along with Qwen2.5-220K-Math fine-tuned on the dataset, OpenR1-Qwen-7B
> Nomic AI released new Nomic Embed multilingual retrieval model, a MoE with 500 params with 305M active params, outperforming other models
> DeepScaleR-1.5B-Preview is a new DeepSeek-R1-Distill fine-tune using distributed RL on math
> LIMO is a new fine-tune of Qwen2.5-32B-Instruct on Math

🗣️ Audio
> Zonos-v0.1 is a new family of speech recognition models, which contains the model itself and embeddings

🖼️ Vision and Image Generation
> We have ported DepthPro of Apple to transformers for your convenience!
> illustrious-xl-v1.0 is a new illustration generation model

3 replies

liked a Space 9 days ago

OpenLLM French leaderboard 🇫🇷

🥇

Explore and compare LLM benchmarks and submit models for evaluation

updated a Space 11 days ago

Hugger Lover

😍

Give some love to a Hugging Face profile.

liked a model 11 days ago

meta-llama/Llama-3.1-70B-Instruct

Text Generation • Updated Dec 15, 2024 • 365k • • 790

updated a Space 12 days ago

— Hub API Playground —

🕹

Try the Hugging Face API through the playground

New activity in enzostvs/hub-api-playground 12 days ago

Update utils/datas/api_collections.ts

#5 opened 12 days ago by

frascuchon

liked a Space 19 days ago

1.03k

IC Light

📈

Generate relit images from your photo

liked a Space 20 days ago

335

Magic Face

🤪

Transform Your Face Into Legendary Characters!

New activity in enzostvs/zero-gpu-spaces 22 days ago

Upload الرسالة كاملة بي دي اف pdf.pdf

#4 opened 22 days ago by

Mansouralfaifi

reacted to merve's post with 👍 22 days ago

Post

3837

This week in open AI was 🔥 Let's recap! 🤗 merve/january-31-releases-679a10669bd4030090c5de4d
LLMs 💬
> Huge: AllenAI released new Tülu models that outperform DeepSeek R1 using Reinforcement Learning with Verifiable Reward (RLVR) based on Llama 3.1 405B 🔥
> Mistral AI is back to open-source with their "small" 24B models (base & SFT), with Apache 2.0 license 😱
> Alibaba Qwen released their 1M context length models Qwen2.5-Instruct-1M, great for agentic use with Apache 2.0 license 🔥
> Arcee AI released Virtuoso-medium, 32.8B LLMs distilled from DeepSeek V3 with dataset of 5B+ tokens
> Velvet-14B is a new family of 14B Italian LLMs trained on 10T tokens in six languages
> OpenThinker-7B is fine-tuned version of Qwen2.5-7B-Instruct on OpenThoughts dataset

VLMs & vision 👀
> Alibaba Qwen is back with Qwen2.5VL, amazing new capabilities ranging from agentic computer use to zero-shot localization 🔥
> NVIDIA released new series of Eagle2 models with 1B and 9B sizes
> DeepSeek released Janus-Pro, new any-to-any model (image-text generation from image-text input) with MIT license
> BEN2 is a new background removal model with MIT license!

Audio 🗣️
> YuE is a new open-source music generation foundation model, lyrics-to-song generation

Codebase 👩🏻‍💻
> We are open-sourcing our SmolVLM training and eval codebase! https://github.com/huggingface/smollm/tree/main/vision
> Open-R1 is open-source reproduction of R1 by @huggingface science team https://huggingface.co/blog/open-r1

1 reply

reacted to fdaudens's post with 🔥 23 days ago

Post

3340

🎯 Kokoro TTS just hit v1.0! 🚀

Small but mighty: 82M parameters, runs locally, speaks multiple languages. The best part? It's Apache 2.0 licensed!
This could unlock so many possibilities ✨

Check it out: hexgrad/Kokoro-82M