archit's picture

archit

archit11

AI & ML interests

small language models, looking for work please reachout [email protected]

Recent Activity

liked a model 6 days ago
NovaSearch/stella_en_400M_v5
liked a dataset 13 days ago
open-thoughts/OpenThoughts-114k
liked a dataset 18 days ago
simplescaling/s1K
View all activity

Organizations

Literally Me FRFR Research Society's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture IndiaBuild's profile picture Hugging Face Discord Community's profile picture

archit11's activity

upvoted an article 18 days ago
view article
Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By Pclanglais β€’
β€’ 29
New activity in ubermenchh/SmolLM2-DPO 22 days ago

details pls

1
#1 opened 22 days ago by
archit11
upvoted an article 23 days ago
view article
Article

How to deploy and fine-tune DeepSeek models on AWS

β€’ 46
upvoted an article 25 days ago
view article
Article

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

By davanstrien β€’
β€’ 8
upvoted an article about 1 month ago
view article
Article

Train 400x faster Static Embedding Models with Sentence Transformers

β€’ 148