Rui Malheiro PRO

rmpmalheiro

ruimalheiro

AI & ML interests

None yet

Recent Activity

liked a Space 5 days ago

nanotron/ultrascale-playbook

liked a dataset 10 days ago

xiaowu0162/longmemeval

liked a dataset 15 days ago

allenai/tulu-3-sft-olmo-2-mixture

View all activity

Organizations

rmpmalheiro's activity

liked a Space 5 days ago

1.41k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 10 days ago

xiaowu0162/longmemeval

Updated Nov 7, 2024 • 86 • 3

liked 4 datasets 15 days ago

liked a Space 15 days ago

775

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a dataset 25 days ago

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 12.1k • 300

liked a model 25 days ago

HuggingFaceTB/SmolLM-135M

Text Generation • Updated Aug 1, 2024 • 164k • • 195

liked a dataset 25 days ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 4 days ago • 228k • 103k • 594

liked 2 models about 1 month ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated about 4 hours ago • 4.43M • • 10.1k

unsloth/phi-4-GGUF

Text Generation • Updated Jan 13 • 36.3k • 150

liked a dataset about 1 month ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 14k • 1.27k

liked 2 models about 1 month ago

sentence-transformers/static-similarity-mrl-multilingual-v1

sentence-transformers/static-retrieval-mrl-en-v1

liked a Space 2 months ago

519

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked a dataset 4 months ago

allenai/dolma

Updated Apr 17, 2024 • 1.65k • 876

liked 3 models 4 months ago

amd/AMD-OLMo

Text Generation • Updated Nov 3, 2024 • 75

HuggingFaceTB/SmolLM2-135M

Text Generation • Updated 18 days ago • 224k • 62

HuggingFaceTB/SmolLM2-135M-Instruct

Text Generation • Updated 18 days ago • 160k • • 139