OpenVINO/Llama-3.1-8B-Instruct-FastDraft-150M-int8-ov
Updated
•
892
•
9
Collection of OpenVINO optimized efficient draft models for speculative decoding
Totally Free + Zero Barriers + No Login Required