Hynek Kydlicek's picture

Hynek Kydlicek PRO

hynky

·

AI & ML interests

Data-processing

Recent Activity

liked a Space 4 days ago

nanotron/ultrascale-playbook

new activity 5 days ago

nanotron/ultrascale-playbook:fix optims

updated a Space 5 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

hynky's activity

liked a Space 4 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 2 months ago

data-is-better-together/fineweb-c

Viewer • Updated 12 days ago • 62.1k • 3.3k • 39

liked a dataset 3 months ago

HuggingFaceFW/fineweb-2

Viewer • Updated Jan 8 • 12.5B • 69.3k • 432

liked a Space 3 months ago

Number Tokenization Blog

Explore how tokenization affects arithmetic in LLMs

liked a dataset 3 months ago

CohereForAI/Global-MMLU

Viewer • Updated Dec 12, 2024 • 602k • 13.2k • 106

liked a Space 3 months ago

Discussion Forum

liked a dataset 4 months ago

ClusterlabAi/InstAr-500k

Viewer • Updated Jul 30, 2024 • 481k • 161 • 10

liked a Space 4 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Evaluate multilingual models using FineTasks

liked a dataset 4 months ago

LLM360/TxT360

Preview • Updated Nov 8, 2024 • 569k • 223

liked 2 Spaces 5 months ago

Hub LFS Analysis

An analysis of LFS files on the Hub.

TxT360: Trillion Extracted Text

Create a large, deduplicated dataset for LLM pre-training

liked a dataset 6 months ago

Cleanlab/bad_data_gsm8k_svamp.csv

Viewer • Updated Apr 25, 2024 • 34 • 61 • 3

liked a Space 6 months ago

Datasets Metrics Explorer

liked 3 datasets 7 months ago

ThaiSyntheticQA/ThaiQA-v1

Viewer • Updated Jul 24, 2024 • 12.7k • 79 • 4

coastalcph/fairlex

Updated Jul 27, 2023 • 176 • 7

meta-llama/Llama-3.1-405B-Instruct-evals

Viewer • Updated Oct 2, 2024 • 158k • 162 • 21

liked 3 datasets 8 months ago

jon-tow/okapi_mmlu

Updated Oct 24, 2023 • 396 • 5

pakphum/winograd_th

Viewer • Updated Nov 16, 2024 • 285 • 71 • 4

scb10x/thai_exam

Viewer • Updated Jul 8, 2024 • 590 • 190 • 11

liked a Space 8 months ago

Open-LLM performances are plateauing, let’s make the leaderboard steep again

Update leaderboard for fair model evaluation