0.6B-Reranker GGUF - Working

by JonathanMiddleton - opened 2 days ago

2 days ago

•

I was unable to find a working 0.6B-Reranker quant so I added a set here: https://huggingface.co/JonathanMiddleton/Qwen3-Reranker-0.6B. These have been verified with ngxson's unmerged llama.cpp branch. Details in the model card.

Thank you so much for these models!

JonathanMiddleton changed discussion title from 0.6B GGUF - Working to 0.6B-Reranker GGUF - Working 2 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment