Henok Yemam's picture

7 1

Henok Yemam PRO

henokyemam

·

AI & ML interests

Machine Learning

Recent Activity

liked a model 3 days ago

google/embeddinggemma-300m

upvoted a paper 3 days ago

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

updated a model 9 days ago

henokyemam/gemma-3-270m-it-full-sft-ssp-august28

View all activity

Organizations

None yet

upvoted a paper 3 days ago

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

Paper • 2508.16279 • Published 16 days ago • 30

upvoted an article 10 days ago

Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

By

and 1 other •

May 2, 2022

• 9

upvoted a paper 11 days ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published 16 days ago • 132

upvoted 3 papers about 1 month ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 241

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 135

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 286

upvoted a collection 5 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209