13 9 15

Garreth Lee

garrethlee

AI & ML interests

None yet

Recent Activity

liked a Space 3 days ago

nanotron/ultrascale-playbook

upvoted an article 10 days ago

1 Billion Classifications

upvoted an article 24 days ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

View all activity

Organizations

garrethlee's activity

liked a Space 3 days ago

1.34k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 14 days ago • 4.43M • • 9.98k

liked a dataset 3 months ago

HuggingFaceFW/fineweb-2

Viewer • Updated Jan 8 • 12.5B • 69.3k • 432

liked 3 Spaces 3 months ago

Number Tokenization Blog

📈

Explore how tokenization affects arithmetic in LLMs

426

Synthetic Data Generator

🧬

Build datasets using natural language

Hub LFS Analysis

📈

An analysis of LFS files on the Hub.

liked a model 3 months ago

GoToCompany/gemma2-9b-cpt-sahabatai-v1-instruct

Updated Nov 6, 2024 • 2.27k • 34

liked a Space 3 months ago

Sahabat-AI Chatbot (Gemma2 9b)

😻

Chatbot

liked 2 datasets 3 months ago

indolem/IndoMMLU

Updated Oct 11, 2023 • 12.7k • 15

PleIAs/common_corpus

Viewer • Updated 12 days ago • 470M • 47.4k • 239

liked a Space 4 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

liked 2 Spaces 5 months ago

101

TxT360: Trillion Extracted Text

📖

Create a large, deduplicated dataset for LLM pre-training

903

Model Memory Utility

🚀

Calculate memory needed to train AI models

liked a Space 6 months ago

769

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a model 11 months ago

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • Updated Sep 27, 2024 • 3.77M • • 2.66k