2 59 224

Blanc Swan

blancsw

https://swan-blanc.fr/

AI & ML interests

ChatBot

Recent Activity

upvoted a paper 5 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

liked a model 10 days ago

Unbabel/TowerInstruct-Mistral-7B-v0.2

liked a dataset 10 days ago

Unbabel/TowerBlocks-v0.1

View all activity

Organizations

blancsw's activity

upvoted a paper 5 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 11 days ago • 139

liked a model 10 days ago

Unbabel/TowerInstruct-Mistral-7B-v0.2

Translation • Updated Sep 4, 2024 • 1.35k • 16

liked a dataset 10 days ago

Unbabel/TowerBlocks-v0.1

Viewer • Updated Mar 4, 2024 • 637k • 97 • 28

liked a model 10 days ago

LLaMAX/LLaMAX3-8B

Text Generation • Updated Dec 6, 2024 • 174 • 35

liked a Space 11 days ago

539

Open Deep-Research

🏆

OpenAI's Deep Research, but open

upvoted an article 11 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted 2 papers 11 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 13 days ago • 134

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 12 days ago • 43

updated a model 12 days ago

Infomaniak-AI/smolLM2-135M-Instruct-movie-reco

Updated 12 days ago

published a model 13 days ago

Infomaniak-AI/smolLM2-135M-Instruct-movie-reco

Updated 12 days ago

updated a model 13 days ago

Infomaniak-AI/smolLM2-135M-Instruct-structure-output

Text Generation • Updated 13 days ago • 26

published 2 models 13 days ago

blancsw/SmolLM2-135M-Instruct-structure-output

Updated 13 days ago

Infomaniak-AI/smolLM2-135M-Instruct-structure-output

Text Generation • Updated 13 days ago • 26

upvoted an article 14 days ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 321

liked a dataset 14 days ago

ChristianAzinn/json-training

Viewer • Updated Aug 23, 2024 • 20.6k • 375 • 16

liked a model 16 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 15 days ago • 1.04M • • 1.15k

liked a dataset 16 days ago

IJUN/FakeNews

Viewer • Updated Jan 13 • 362 • 82 • 2

upvoted an article 20 days ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 201

liked a model 22 days ago

ibm-granite/granite-embedding-278m-multilingual

upvoted a collection 26 days ago

SmolVLM 256M & 500M

Collection

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 3 days ago • 69