Junming Liu

I am a third-year Master’s student in Computer Science at Tongji University and a Research Intern at the Shanghai AI Lab. My research primarily focuses on Generative Intelligence, Multimodal Reasoning, and Graph Theory.

My work seeks to enhance the creative fidelity and cognitive depth of AI systems, while ensuring logical consistency through knowledge representations. Recently, my research has been centered around the following areas:

Diffusion Models for General and Medical domains.
Memory-Augmented Agents.
Post-training of Multimodal Large Language Models for Spatial Cognition.

I am actively seeking PhD opportunities starting Fall 2026. I would be thrilled to work with prospective advisors and research groups. Please feel free to contact me.

News

Jan, 2026	Our paper DMM has been accepted by ICASSP 2026! 🎉🎉
Jan, 2026	Our paper AMID has been accepted by WWW 2026! 🎉🎉
Nov, 2025	Our paper ReBrain has been accepted by WACV 2026! 🎉🎉 Code available at Link. 🤗🤗
Aug, 2025	Our paper COGO has been accepted by PRCV 2025! 🎉🎉
Jul, 2025	Our paper HM-RAG has been accepted by ACM MM 2025! 🎉🎉 Code available at Link. 🤗🤗
Jun, 2025	Our paper VaLiK has been accepted by ICCV 2025! 🎉🎉 Code available at Link. 🤗🤗
Jan, 2025	Join Shanghai AI Lab as a Research Intern, targeting Knowledge Reasoning!⚡️️⚡️

Selected Publications

WWW

AMID: Model-Agnostic Dataset Distillation by Adversarial Mutual Information Minimization

Aoqi Wu, Junming Liu, Yuwei Zhang, Weiquan Huang, Liang Hu^†, Yifan Yang, Qi Zhang, Jiaxing Miao, Yuhan Tang, and Zhongyuan Lai

Proceedings of the ACM on Web Conference, 2026

PDF
ACM MM

HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation

Pei Liu, Xin Liu, Ruoyu Yao, Junming Liu, Siyuan Meng, Ding Wang^†, and Jun Ma^†

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

PDF Code
ICCV

Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning

Junming Liu, Siyuan Meng, Yanting Gao, Song Mao, Pinlong Cai, Guohang Yan, Yirong Chen, Zilin Bian, Ding Wang^†, and Botian Shi

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

PDF Code