Sourab Mangrulkar

smangrul

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision, Reinforcement Learning

Recent Activity

liked a model about 2 months ago
mistralai/Mistral-7B-Instruct-v0.3
liked a model about 2 months ago
mistralai/Mamba-Codestral-7B-v0.1
liked a model 3 months ago
meta-llama/Llama-3.2-11B-Vision-Instruct
View all activity

Organizations

Speech Recognition Community Event Version 2's profile picture BigScience Data's profile picture group2's profile picture BigCode's profile picture Diffusers Pipelines Library for Stable Diffusion's profile picture Social Post Explorers's profile picture

smangrul's activity

published an article 11 months ago
view article
Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

26
published an article about 1 year ago
view article
Article

🤗 PEFT welcomes new merging methods

16
published an article about 1 year ago
view article
Article

Mixture of Experts Explained

391
published an article over 1 year ago
view article
Article

Personal Copilot: Train Your Own Coding Assistant

44
published an article over 1 year ago
view article
Article

Fine-tuning Llama 2 70B using PyTorch FSDP

19
published an article over 1 year ago
view article
Article

The Falcon has landed in the Hugging Face ecosystem

12
published an article over 1 year ago
view article
Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

119
published an article almost 2 years ago
view article
Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

42
published an article about 2 years ago
view article
Article

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

57
published an article over 2 years ago
view article
Article

Accelerate Large Model Training using DeepSpeed

3
published an article almost 3 years ago
view article
Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

3