Welcome to the MM Group

Welcome to the Group of Multimedia (MM) @ College of Intelligence and Computing, Tianjin University, China. Our research has been mainly in the basic theories, efficient algorithms and application of artificial intelligence and machine learning. We mainly focus on cross-media perception, understanding and reasoning, as well as robust intelligent learning models. Aiming at the semantic gap in multimedia data, we explore visual and semantic understanding of image, video, and text data. Our research interests also include the adversarial attack and robustness improvement of deep neural networks on data containing natural noises or adversarial noises to cope with low generalization ability and high vulnerability of machine learning methods under noisy scenarios. We are revolving around intelligent learning and games for open domains, including machine learning aided by “cloud-terminal” cooperation, privacy-preserving domain adaptation and adversarial games. Recently, we are actively exploring Embodied Intelligence, aiming to integrate perception, learning, and decision-making to enhance adaptability and robustness in real-world applications. The central goal of our researches is to improve the interpretability and robustness of deep learning models based on the theoretical basis of cross-media understanding and adversarial machine learning, so as to build more reliable multimedia and intelligent learning systems.

We have published papers on leading journals and conferences of multimedia, computer vision, machine learning, and artificial intelligence, such as IEEE TPAMI, IEEE TIP, IEEE TKDE, IEEE TNNLS, IEEE TMM, IEEE TCSVT, IEEE TCYB, ACM MM, CVPR, ICCV, ECCV, NeurIPS, AAAI, IJCAI, etc. A PhD student was awarded the China Society of Image and Graphics (CSIG) Outstanding Dissertation 2021, and two Master students were selected for the Tencent Rhino-Bird Elite Talent Training Program in 2018 and 2020, respectively. Our team achieved third place overall in the Untargeted and Targeted Attack Tracks of the NeurIPS 2018 Adversarial Vision Challenge. Our paper was awarded “Best Paper Finalist” of ACM Multimedia 2017. We also got winner records in main technical challenges such as the Champion of the Large Scale Movie Description Challenge (LSMDC 2017, joint with ICCV 2017) and the Runner-up of the 2nd MSR Large-Scale Video to Language Challenge (Honorable Mention Award of Grand Challenge @ ACM MM 2017).

We are looking for passionate new PhD students and Master students to join the team !
If you are interested, please contact Prof. Yahong Han

Undergraduate courses:
Media Computing

News

19. July 2025

Mingshi's paper "Latent Factor Modeling with Expert Network for Multi-Behavior Recommendation" was accepted by IEEE TKDE (CCF A).

5. July 2025

Xu Chen's paper "Ex Pede Herculem, Predicting Global Actionness Curve from Local Clips" was accepted by ACM Multimedia 2025 (CCF-A).

26. June 2025

Three papers were accepted by ICCV 2025 (CCF-A).

5. April 2025

Zihao's CVPR accpeted paper "Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection" was selected as CVPR 2025 Highlights (rate 13.5%).

28. March 2025

Mingshi Yan's paper "User Invariant Preference Learning for Multi-Behavior Recommendation" was accepted by ACM TOIS (CCF-A).

25. March 2025

Nana Yu's paper "Semantic Prompt Enhancement for Semi-Supervised Low-Light Salient Object Detection" was accepted by IEEE TNNLS.

27. February 2025

Two papers were accepted by CVPR 2025 (CCF-A).

Welcome to the MM Group

News

... see all News