science-of-finetuning/gemma-2-2b-crosscoder-l13-mu4.1e-02-lr1e-04 Feature Extraction • Updated Nov 22, 2024 • 578 • 2
FlofloB/100k_fineweb_continued_pretraining_Qwen2.5-0.5B-Instruct_Unsloth_merged_16bit Text Generation • Updated Jan 21 • 110 • 1