Intel/Llama-3.3-70B-Instruct-AutoRound-Recipe
Updated
A collection of low precision models generated by Intel Neural Compressor including mxfp8, mxfp4 and nvfp4.
Totally Free + Zero Barriers + No Login Required