Adil Zouitine

AdilZtn

AI & ML interests

Reinforcement learning, Robotics, Computer vision, Scaling

Recent Activity

new activity 16 days ago
lerobot/pi0:π0-FAST
reacted to merve's post with 🚀 16 days ago
Interesting releases in open AI this week, let's recap 🤠 https://huggingface.co/collections/merve/feb-7-releases-67a5f7d7f172d8bfe0dd66f4 🤖 Robotics > Pi0, first open-source foundation vision-language action model was released in Le Robot (Apache 2.0) 💬 LLMs > Groundbreaking: s1 is simpler approach to test-time scaling, the release comes with small s1K dataset of 1k question-reasoning trace pairs (from Gemini-Thinking Exp) they fine-tune Qwen2.5-32B-Instruct to get s1-32B, outperforming o1-preview on math 🤯 s1-32B and s1K is out! > Adyen released DABstep, a new benchmark along with it's leaderboard demo for agents doing data analysis > Krutrim released Krutrim-2 instruct, new 12B model based on NeMo12B trained and aligned on Indic languages, a new multilingual sentence embedding model (based on STSB-XLM-R), and a translation model for Indic languages 👀 Multimodal > PKU released Align-DS-V, a model aligned using their new technique called LLF for all modalities (image-text-audio), along with the dataset Align Anything > OLA-7B is a new any-to-any model by Tencent that can take text, image, video, audio data with context window of 32k tokens and output text and speech in English and Chinese > Krutrim released Chitrarth, a new vision language model for Indic languages and English 🖼️ Vision > BiRefNet_HR is a new higher resolution BiRefNet for background removal 🗣️ Audio > kyutai released Hibiki, it's a real-time speech-to-speech translation model 🤯 it's available for French-English translation > Krutrim released Dhwani, a new STT model for Indic languages > They also release a new dataset for STT-TTS 🖼️ Image Generation > Lumina released Lumina-Image-2.0, a 2B parameter-flow based DiT for text to image generation > Tencent released Hunyuan3D-2, a 3D asset generation model based on DiT and Hunyuan3D-Paint > boreal-hl-v1 is a new boring photorealistic image generation LoRA based on Hunyuan
View all activity

Organizations

Hugging Face's profile picture LeRobot's profile picture

AdilZtn's activity

upvoted an article 2 days ago
view article
Article

SigLIP 2: A better multilingual vision language encoder

71
New activity in lerobot/pi0 16 days ago

π0-FAST

3
#2 opened 18 days ago by
supermodelteam
reacted to merve's post with 🚀 16 days ago
view post
Post
3039
Interesting releases in open AI this week, let's recap 🤠 merve/feb-7-releases-67a5f7d7f172d8bfe0dd66f4

🤖 Robotics
> Pi0, first open-source foundation vision-language action model was released in Le Robot (Apache 2.0)

💬 LLMs
> Groundbreaking: s1 is simpler approach to test-time scaling, the release comes with small s1K dataset of 1k question-reasoning trace pairs (from Gemini-Thinking Exp) they fine-tune Qwen2.5-32B-Instruct to get s1-32B, outperforming o1-preview on math 🤯 s1-32B and s1K is out!
> Adyen released DABstep, a new benchmark along with it's leaderboard demo for agents doing data analysis
> Krutrim released Krutrim-2 instruct, new 12B model based on NeMo12B trained and aligned on Indic languages, a new multilingual sentence embedding model (based on STSB-XLM-R), and a translation model for Indic languages

👀 Multimodal
> PKU released Align-DS-V, a model aligned using their new technique called LLF for all modalities (image-text-audio), along with the dataset Align Anything
> OLA-7B is a new any-to-any model by Tencent that can take text, image, video, audio data with context window of 32k tokens and output text and speech in English and Chinese
> Krutrim released Chitrarth, a new vision language model for Indic languages and English

🖼️ Vision
> BiRefNet_HR is a new higher resolution BiRefNet for background removal

🗣️ Audio
> kyutai released Hibiki, it's a real-time speech-to-speech translation model 🤯 it's available for French-English translation
> Krutrim released Dhwani, a new STT model for Indic languages
> They also release a new dataset for STT-TTS

🖼️ Image Generation
> Lumina released Lumina-Image-2.0, a 2B parameter-flow based DiT for text to image generation
> Tencent released Hunyuan3D-2, a 3D asset generation model based on DiT and Hunyuan3D-Paint
> boreal-hl-v1 is a new boring photorealistic image generation LoRA based on Hunyuan
upvoted an article 19 days ago
view article
Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

106
reacted to fdaudens's post with ❤️ 6 months ago
view post
Post
1979
📫 AI in the news today: Struggling AI startups, Figure 0 robot, chipmakers

- OpenAI Co-Founders Schulman and Brockman Step Back
https://finance.yahoo.com/news/openai-co-founders-schulman-brockman-010542796.html

- Struggling AI Startups Look for a Bailout from Big Tech
"More exits—either pseudo-acquisitions or real ones—are coming, investors say, as a bubble built by the excitement around generative AI is showing signs of peaking."
https://www.wsj.com/tech/ai/struggling-ai-startups-look-for-a-bailout-from-big-tech-3e635927?mod=rss_Technology

- Did Google Just Pay $2.5 Billion to Hire Character's CEO?
https://www.theinformation.com/articles/did-google-just-pay-2-5-billion-to-hire-characters-ceo

- Figure’s new humanoid robot leverages OpenAI for natural speech conversations
Figure has unveiled its latest humanoid robot, the Figure 02.
The most notable addition this time out arrives by way a longstanding partnership with OpenAI, which helped Figure raise a $675 million Series B back in February, valuing the South Bay firm at $2.6 billion.
https://techcrunch.com/2024/08/06/figures-new-humanoid-robot-leverages-openai-for-natural-speech-conversations/

- World’s Five Leading Chipmakers Have Now Promised U.S. Investment
The Biden administration award up to $450 million in grants to a South Korean chipmaker, SK Hynix, to help build its new chip facility in Indiana
The US now has commitments from all five of the world’s leading-edge semiconductor manufacturers to construct chip plants in theUS with financial assistance from the administration
https://www.nytimes.com/2024/08/06/business/economy/chipmakers-promise-investment.html