nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16_2of4_channel-e2e
0.3B
•
Updated
•
5
nm-testing/TinyLlama-1.1B-Chat-v1.0-sparse2of4_only-e2e
0.7B
•
Updated
•
6
nm-testing/TinyLlama-1.1B-Chat-v1.0-sparse2of4_fp8_dynamic-e2e
0.7B
•
Updated
•
7
nm-testing/TinyLlama-1.1B-Chat-v1.0-kv_cache_default_tinyllama-e2e
1B
•
Updated
•
3
nm-testing/Phi-3-mini-4k-instruct-kv_cache_default_phi3-e2e
4B
•
Updated
•
4
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8_tensor_weight_static_per_tensor_act-e2e
1B
•
Updated
•
3
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8_channel_weight_static_per_tensor-e2e
1B
•
Updated
•
3
nm-testing/Qwen3-30B-A3B-W4A16-first-10-e2e-e2e
25B
•
Updated
nm-testing/Qwen3-30B-A3B-FP8_DYNAMIC-e2e
31B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-e2e
1B
•
Updated
•
102
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8A16_tensor-e2e
1B
•
Updated
•
9
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8A16_channel-e2e
1B
•
Updated
•
6
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8_BLOCK-e2e
1B
•
Updated
•
5
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4A16-e2e
0.7B
•
Updated
•
3
nm-testing/tinysmokeqwen3moe-W4A16-first-only-CTstable
2.54M
•
Updated
•
1.73k
nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Qwen3-32B-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Qwen3-32B-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Head
Updated
nm-testing/DeepSeek-R1-Distill-Qwen-32B-NVFP4
Text Generation
•
19B
•
Updated
•
1.26k
•
1
nm-testing/tinysmokeqwen3moe-W4A16-first-only
2.54M
•
Updated
nm-testing/tinysmokeqwen3moe
2.93M
•
Updated