EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence
Paper
β’
2509.14977
β’
Published
β’
3
Official PyTorch implementation of the model described in
"EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence".
| Item | Value |
|---|---|
| Paper | arXiv:2509.14977 |
| Authors | Chaoyin SheΒΉ, Ruifang LuΒ² |
| Code | GitHub repo |
| Model Hub | Hugging Face |
Reference Qwen2.5-VL-7B-Instruct
If you use this model or code in your research, please cite:
@misc{she2025echovlmdynamicmixtureofexpertsvisionlanguage,
title={EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence},
author={Chaoyin She and Ruifang Lu and Lida Chen and Wei Wang and Qinghua Huang},
year={2025},
eprint={2509.14977},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2509.14977},
}
Base model
lingshu-medical-mllm/Lingshu-7B