Running 1776 with draft model

by ernestr - opened 1 day ago

1 day ago

Thanks for the quant @matatonic ! I'm currently running it with Llama3.2 as a draft model. It's working well but I'm curious if using a draft model degrades the thinking.

ernestr changed discussion status to closed 1 day ago

matatonic

Owner 1 day ago

It doesn't degrade the thinking, I use the 1B 3.2 as draft also, worst case it doesn't make it faster.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment