Ali El Filali

alielfilali01

AI & ML interests

AI Psychometrician ? | NLP (mainly for Arabic) | Interests include Reinforcement Learning and Cognitive sciences among others

Recent Activity

upvoted an article 1 day ago

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

upvoted a collection 1 day ago

IndicGenBench

upvoted a collection 1 day ago

PaliGemma 2 Mix

View all activity

Organizations

alielfilali01's activity

upvoted an article 1 day ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

•

Jul 27, 2024

• 31

upvoted 3 collections 1 day ago

liked a model 1 day ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 8 days ago • 1.58M • 531

reacted to MohamedRashad's post with 🚀❤️ 2 days ago

Post

1681

A while back i shared this model MohamedRashad/arabic-small-nougat that was a finetune from facebook/nougat-small for the Arabic Language.

Today this humble project has been scaled with new models, new datasets, new space, and a new paper

Check everything throught this collection here:
MohamedRashad/arabic-nougat-673a3f540bd92904c9b92a8e

1 reply

liked a dataset 2 days ago

MBZUAI/TimeTravel

Viewer • Updated 2 days ago • 10.3k • 118 • 7

updated a model 2 days ago

inceptionai/jais-adapted-70b-chat-4bit-bnb

Text Generation • Updated 2 days ago

published a model 2 days ago

inceptionai/jais-adapted-70b-chat-4bit-bnb

Text Generation • Updated 2 days ago

liked a dataset 3 days ago

Qwen/P-MMEval

Viewer • Updated Nov 28, 2024 • 19.7k • 1.65k • 9

liked a model 3 days ago

Qwen/Qwen2.5-7B-Instruct-1M

Text Generation • Updated 25 days ago • 293k • 238

posted an update 3 days ago

Post

539

🚨 Arabic LLM Evaluation 🚨

Few models join the ranking of inceptionai/AraGen-Leaderboard Today.

The new MistralAI model, Saba, is quite impressive, Top10 ! Well done @arthurmensch and team.

Sadly Mistral did not follow its strategy about public weights this time, we hope this changes soon and we get the model with a permissive license.

We added other Mistral models and apparently, we have been sleeping on mistralai/Mistral-Large-Instruct-2411 !

Another impressive model that joined the ranking today is ALLaM-AI/ALLaM-7B-Instruct-preview. After a long wait finally ALLaM is here and it is IMPRESSIVE given its size !

ALLaM is ranked on OALL/Open-Arabic-LLM-Leaderboard as well.

updated a Space 3 days ago

AraGen Leaderboard

📊

Generative Tasks Evaluation of Arabic LLMs

liked a model 3 days ago

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 8 days ago • 213k • 315

commented on PaliGemma 2 Mix - New Instruction Vision Language Models by Google 3 days ago

Hey there, is from transformers import AutoProcessor, AutoModelForVision2Seq can be useed for all VLMs or do we have to go with from transformers import PaliGemmaProcessor, PaliGemmaForConditionalGeneration for PaliGemma and from transformers import Qwen2_5_VLForConditionalGeneration, AutoTokenizer, AutoProcessor for QwenVL ? My goal is to have a unified script for all VLMs on the Hub given a model_name as arg

reacted to merve's post with 🚀🧠 3 days ago

Post

4744

Google just released PaliGemma 2 Mix: new versatile instruction vision language models 🔥

> Three new models: 3B, 10B, 28B with res 224, 448 💙
> Can do vision language tasks with open-ended prompts, understand documents, and segment or detect anything 🤯

Read more https://huggingface.co/blog/paligemma2mix
Try the demo google/paligemma2-10b-mix
All models are here google/paligemma-2-mix-67ac6a251aaf3ee73679dcc4

reacted to dreamerdeo's post with 🤗🚀 3 days ago

Post

2697

🚀 Excited to share our technical report on the Southeast Asian multilingual model Sailor2 and its latest updates!

Our 49-page report details Sailor2's development journey, including multilingual data cleaning, small model data mixture simulations, multi-stage continual pre-training, multi-stage post-training, and multi-cultural multi-lingual evaluations. Sailor2 aims to streamline the multilingual model pre-training process efficiently for the community.

🧭 We highlight Sailor2's impressive performance in low-resource language translation scenarios and its cultural understanding advantages in Southeast Asia, promoting practical applications for regional languages.

Model updates include:
💡 More precise outputs: Reduced redundancy in model outputs through refined post-training data and optimization techniques.
🌈 Handling longer texts: Expanded to handle up to 128K context length in Southeast Asian languages through long-text training.
⚡️ Faster inference: Achieved 2.5x faster inference speed with speculative decoding.
🌪️ More model sizes: Introduced new sizes of 3B and 14B through model pruning.

🌟 All models are Apache-licensed for commercial use; development tools (code, resources) are open-source.

📚 Technical report: Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs (2502.12982)
🤖️ Models: sail/sailor2-language-models-674d7c9e6b4dbbd9a869906b
💬 Demo: sail/Sailor2-20B-Chat
📣 Sailor2 community: https://huggingface.co/sailor2