Mark Washington's picture

Mark Washington

Mdubbya

·

AI & ML interests

None yet

Recent Activity

liked a dataset 10 minutes ago

nuprl/MultiPL-E

liked a dataset 19 minutes ago

openai/openai_humaneval

liked a model about 3 hours ago

AlexBefest/CardProjector-24B-v1

View all activity

Organizations

None yet

Mdubbya's activity

upvoted an article 3 days ago

Article

WTF is Fine-Tuning? (intro4devs) | [2025]

By

•

7 days ago

• 6

upvoted a collection 6 days ago

Long Context - 16k,32k,64k,128k,200k,256k,512k,1000k

Q6/Q8 models here. Mixtrals/Mistral (and merges) generally have 32k context (not listed here) . Please see org model card for usage / templates. • 69 items • Updated 7 days ago • 10

upvoted a collection 13 days ago

Open-source speech datasets annotated using Data-Speech

Open-source annotated speech datasets ranging from 1,000 hours to 45,000 hours. • 11 items • Updated Aug 8, 2024 • 5

upvoted a collection 17 days ago

DeepSeek-R1-ReDistill

Re-distilled DeepSeek R1 models • 4 items • Updated 25 days ago • 14

upvoted 2 collections 2 months ago

QwQ-abliterate

4 items • Updated Nov 30, 2024 • 4

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 13 days ago • 88

upvoted a collection 3 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 13 days ago • 70

upvoted a collection 6 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 357

upvoted 2 collections 7 months ago

K2

K2, LLM360's most powerful, scaled model series. • 7 items • Updated Oct 7, 2024 • 10

DeepSeekCoder-V2

6 items • Updated Sep 5, 2024 • 93

upvoted a collection 8 months ago

InternLM2.5

14 items • Updated 13 days ago • 72

upvoted 4 collections 9 months ago

CodeGemma Release

18 items • Updated Dec 13, 2024 • 81

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Dec 13, 2024 • 330

H2O Danube2

4 items • Updated Oct 17, 2024 • 16

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Dec 13, 2024 • 145