About me
I’m a first-year PhD student at CU Boulder in the IVC Group, advised by Prof. Danna Gurari. My research focuses on Computer Vision and Multi-Modal Learning, and I’m working on expand visual grounding to video and 3D spatial spaces to improve accessibility for people with visual impairments.
Before I came to boulder, I co-founded RoastPic with Prof. William Ristenpart and our wonderful team. And I’m very proud that we brought the first Mobile App that able to do roast level analysis of coffee beans. I also had a great time in Davis where Prof. Jiawei Zhang showed me what research was like and also advised me in research in Vision-language models and Video Quality.
Beyond my research, I earned my Q certification in 2020, which recognize the ability to assess quality of specialty coffee worldwide. I’m also pretty boulder sterotype where I spent a lot of time in the mountains hiking and skiing.
Publications and Technical Reports
- Accounting for Focus Ambiguity in Visual Questions
Chongyan Chen*, Yu-Yun Tseng*, Zhuoheng Li*, Anush Venkatesh, Danna Gurari Under Review A Survey of AI-Generated Video Evaluation
Xiao Liu, Xinhao Xiang, Zizhong Li, Yongheng Wang, Zhuoheng Li, Zhuosheng Liu, Weidi Zhang, Weiqi Ye, Jiawei Zhang Under ReviewParameter-Efficient Fine-Tuning for Vision-Language Models
Zhuoheng Li, Zhuosheng Liu, Jiawei Zhang; Technical Report- CLIPath: Fine-tune CLIP with Visual Feature Fusion for Pathology Image Analysis Towards Minimizing Data Collection Efforts
Zhengfeng Lai, Zhuoheng Li, Luca Cerny Oliveira, Joohi Chauhan, Brittany N. Dugger, Chen-Nee Chuah; ICCV 2023 Workshop on Computer Vision for Automated Medical Diagnosis