2 17 101

Alberto Cetoli PRO

fractalego

https://fractalego.social/@alberto

AI & ML interests

Entity/relation extraction, Q&A, Summarisation

Recent Activity

upvoted a paper 8 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

liked a model 15 days ago

openai/gpt-oss-20b

reacted to mitkox's post with 😎 27 days ago

I run Qwen3-Coder 480B locally on my Z8, with a 1-million token context window. It’s the equivalent of parallel-parking a Nimitz-class carrier in a kiddie pool. Thanks to whatever dark pact the llama.cpp, CUDA, and kernel folks signed, hybrid inferencing + VRAM↔RAM offload let me stream the model’s synapses across Xeon, RAM, and four lonely A6000s without summoning either the OOM killer or a small house fire.

View all activity

Organizations

upvoted a paper 8 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published 12 days ago • 151

liked a model 15 days ago

openai/gpt-oss-20b

Text Generation • 22B • Updated 7 days ago • 5.43M • • 3.15k

reacted to mitkox's post with 😎 27 days ago

Post

2091

liked 2 models about 1 month ago

mradermacher/Agentic-R1-GGUF

8B • Updated Jul 9 • 77 • 1

moonshotai/Kimi-K2-Instruct

Text Generation • Updated 10 days ago • 430k • • 2.11k

reacted to merve's post with 🔥 2 months ago

Post

2935

Qwen2.5-Omni is soooo good that people build multimodal reasoning models off of it 🥹
> KE-Team/Ke-Omni-R-3B is open-source audio reasoning model sota on average of benchmarks, based on Qwen/Qwen2.5-Omni-3B 🗣️
> Haoz0206/Omni-R1 is a video reasoning model with pixel level grounding (see below) and it's super competitive ⏯️ based on Qwen/Qwen2.5-Omni-7B

liked 3 models 3 months ago

liked a dataset 3 months ago

nvidia/OpenMathReasoning

Viewer • Updated May 27 • 5.68M • 10.2k • 326

reacted to jeffboudier's post with 🚀 3 months ago

Post

2597

Transcribing 1 hour of audio for less than $0.01 🤯

@mfuntowicz cooked with 8x faster Whisper speech recognition - whisper-large-v3-turbo transcribes at 100x real time on a $0.80/hr L4 GPU!

How they did it: https://huggingface.co/blog/fast-whisper-endpoints

1-click deploy with HF Inference Endpoints: https://endpoints.huggingface.co/new?repository=openai%2Fwhisper-large-v3-turbo&vendor=aws&region=us-east&accelerator=gpu&instance_id=aws-us-east-1-nvidia-l4-x1&task=automatic-speech-recognition&no_suggested_compute=true

liked 2 models 4 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated 26 days ago • 1.11M • • 766

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.42k • 1.15k

liked 3 models 5 months ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 784k • • 1.06k

sesame/csm-1b

Text-to-Speech • Updated 29 days ago • 28.4k • 2.17k

manycore-research/SpatialLM-Llama-1B

Text Generation • 1B • Updated Mar 21 • 897 • 982

reacted to BrigitteTousi's post with 🔥🚀 5 months ago

Post

3442

LeRobot goes to driving school! 🚗🚗🚗

Hugging Face just announced a new collab with Yaak to bring the largest open-source self-driving dataset to LeRobot!

Major kudos to HF's @cadene , as well as @sandhawalia , @Shnissen and the Yaak team!

Check out the blog post here: https://huggingface.co/blog/lerobot-goes-to-driving-school

1 reply

reacted to csabakecskemeti's post with 🔥 6 months ago

Post

2850

Testing Training on AMD/ROCm the first time!

I've got my hands on an AMD Instinct MI100. It's about the same price used as a V100 but on paper has more TOPS (V100 14TOPS vs MI100 23TOPS) also the HBM has faster clock so the memory bandwidth is 1.2TB/s.
For quantized inference it's a beast (MI50 was also surprisingly fast)

For LORA training with this quick test I could not make the bnb config works so I'm running the FT on the fill size model.

Will share all the install, setup and setting I've learned in a blog post, together with the cooling shroud 3D design.

8 replies

upvoted an article 6 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Alberto Cetoli PRO

AI & ML interests

Recent Activity

Organizations

fractalego's activity

Open-R1: Update #1