About me

I’m a first-year PhD student at the University of Colorado Boulder in the Image and Video Computing Group, advised by Prof. Danna Gurari. My research focuses on building vision-language models that understand hierarchical object compositions, with applications in visual grounding and accessibility.

Before coming to Boulder, I earned my bachelor’s degree from UC Davis, where I co-founded RoastPic with Prof. William Ristenpart and our wonderful team at the UC Davis Coffee Center. I also had the chance to work with Prof. Jiawei Zhang, who introduced me to research in vision-language modeling and video quality assessment.

Outside of research, I’m really into coffee and the outdoors. I’m a certified Q Arabica Grader and enjoy exploring specialty coffee around the world. If I haven’t replied in a while, I’m probably somewhere in the mountains.

Publications and Technical Reports

Accounting for Focus Ambiguity in Visual Questions
Chongyan Chen*, Yu-Yun Tseng*, Zhuoheng Li*, Anush Venkatesh, Danna Gurari
ICCV 2025
Pathogenic potential prediction of Vibrio parahaemolyticus by using pangenome data with high performance machine learning algorithms
Zhuosheng Liu, Zhuoheng Li, Jiawei Zhang, C Titus Brown, Luxin Wang
bioRxiv Preprint
A Survey of AI-Generated Video Evaluation
Xiao Liu, Xinhao Xiang, Zizhong Li, Yongheng Wang, Zhuoheng Li, Zhuosheng Liu, Weidi Zhang, Weiqi Ye, Jiawei Zhang
arXiv Preprint
Parameter-Efficient Fine-Tuning for Vision-Language Models
Zhuoheng Li, Zhuosheng Liu, Jiawei Zhang
Technical Report
CLIPath: Fine-tune CLIP with Visual Feature Fusion for Pathology Image Analysis Towards Minimizing Data Collection Efforts
Zhengfeng Lai, Zhuoheng Li, Luca Cerny Oliveira, Joohi Chauhan, Brittany N. Dugger, Chen-Nee Chuah
ICCV 2023 Workshop on Computer Vision for Automated Medical Diagnosis

Zhuoheng Li

Publications and Technical Reports