nm-testing/SpeculatorLlama3-1-8B-Eagle3-sgl
nm-testing/Mockup-qwen235-eagle3-fp16-sgl
Updated
•
10
nm-testing/Speculator-Qwen3-8B-Eagle3-sgl
nm-testing/Qwen3-VL-235B-A22B-Instruct-NVFP4
Updated
nm-testing/Mockup-qwen235-eagle3-fp16-speculators-converted
nm-testing/testing-llama3.1.8b-2layer-eagle3
Updated
nm-testing/Llama-3.1-70B-Instruct-FP8-block
Text Generation
•
Updated
nm-testing/Qwen3-235B-A22B-EAGLE3-converted-speculators-lmsys
1B
•
Updated
•
9
nm-testing/Meta-Llama-3-8B-Instruct-attention-fp8
8B
•
Updated
•
6
nm-testing/Qwen2.5-VL-7B-Instruct-INT8_dyn_per_token
8B
•
Updated
•
5
nm-testing/Speculator-Qwen3-8B-Eagle3-converted-071-quantized-w4a16
1B
•
Updated
•
6.68k
nm-testing/llama-3.3-70b-speculators-eagle3
2B
•
Updated
•
8
nm-testing/Apertus-70B-Instruct-2509-quantized.w8a8.damp01.sq08
71B
•
Updated
•
9
nm-testing/Qwen3-30B-A3B-NVFP4-working
17B
•
Updated
•
7
nm-testing/Meta-Llama-3-8B-Instruct-selfattn-w8a8-mlp-w4a16-sequential
3B
•
Updated
•
7
nm-testing/for_testing_gptoss20b_spec_eagle3
0.8B
•
Updated
•
9
nm-testing/Llama-4-Maverick-17B-128E-Instruct-FP8-BLOCK
401B
•
Updated
•
6
nm-testing/Apertus-70B-Instruct-2509-NVFP4
41B
•
Updated
•
4
nm-testing/Apertus-8B-Instruct-2509-NVFP4
5B
•
Updated
•
8
nm-testing/Llama-4-Scout-17B-16E-Instruct-FP8-BLOCK
108B
•
Updated
•
5
nm-testing/tinysmokellama-3.2
354k
•
Updated
•
34.9k
nm-testing/Qwen3-Next-80B-A3B-Instruct-NVFP4
Updated
•
429
•
2
nm-testing/Llama-3.2-1B-Instruct-quip-w4a16
0.8B
•
Updated
•
2.21k
nm-testing/Llama-3.2-1B-Instruct-group-activations
1B
•
Updated
•
6
nm-testing/qwen3-80b-fp8-dynamic
80B
•
Updated
•
7
nm-testing/gemma-3-4b-it-s_q-W4A8-G512
5B
•
Updated
•
5
nm-testing/llama3.3-70B-speculators.09-10-2025-eagle3
2B
•
Updated
•
5
nm-testing/Llama-3.2-1B-Instruct-quipv-w4a16
0.7B
•
Updated
•
6
nm-testing/Llama-3.2-1B-Instruct-quip
2B
•
Updated
•
5
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2-online
0.7B
•
Updated
•
22