nvidia
/

Qwen2.5-VL-7B-Instruct-FP8

Text Generation

Model Optimizer

Model card Files Files and versions

Resources

View closed (0)

How to quantize VL models to NVFP4

#2 opened 11 days ago by

"Unsupported position embedding type: default" when loading in TensorRT-LLM

#1 opened about 1 month ago by

EndlessnessSoul