Welcome to the Group of Multimedia (MM) @ College of Intelligence and Computing, Tianjin University, China. Our research has been mainly in the basic theories, efficient algorithms and application of artificial intelligence and machine learning. We mainly focus on cross-media perception, understanding and reasoning, as well as robust intelligent learning models. Aiming at the semantic gap in multimedia data, we explore visual and semantic understanding of image, video, and text data. Our research interests also include the adversarial attack and robustness improvement of deep neural networks on data containing natural noises or adversarial noises to cope with low generalization ability and high vulnerability of machine learning methods under noisy scenarios. Recently, we are revolving around intelligent learning and games for open domains, including machine learning aided by “cloud-terminal” cooperation, privacy-preserving domain adaptation and adversarial games. The central goal of our researches is to improve the interpretability and robustness of deep learning models based on the theoretical basis of cross-media understanding and adversarial machine learning, so as to build more reliable multimedia and intelligent learning systems.
We have published papers on leading journals and conferences of multimedia, computer vision, machine learning, and artificial intelligence, such as IEEE TPAMI, IEEE TIP, IEEE TKDE, IEEE TNNLS, IEEE TMM, IEEE TCSVT, IEEE TCYB, ACM MM, CVPR, ICCV, ECCV, NeurIPS, AAAI, IJCAI, etc. Our paper was awarded “Best Paper Finalist” of ACM Multimedia 2017. We also got winner records in main technical challenges such as the Champion of the Large Scale Movie Description Challenge (LSMDC 2017, joint with ICCV 2017) and the Runner-up of the 2nd MSR Large-Scale Video to Language Challenge (Honorable Mention Award of Grand Challenge @ ACM MM 2017).
We are looking for passionate new PhD students and Master students to join the team !
If you are interested, please contact Prof. Yahong Han
Undergraduate courses:
Media Computing
The paper "WiViPose: A Video-aided Wi-Fi Framework for Environment-Independent 3D Human Pose Estimation" was accepted by IEEE TMM.
3. December 2024Two papers about saliency object detection were accepted by IEEE TCSVT.
13. November 2024Deng Li's paper "Visual-Language Pre-training Based on Multi-entity Alignment" was accepted by Journal of Software (软件学报).
4. August 2024Jie Wang's paper "Progressive Expansion for Semi-supervised Bi-modal Salient Object Detection" was accepted by Pattern Recognition.
1. June 2024Nana Yu's paper "Degradation-removed Multiscale Fusion for Low-Light Salient Object Detection" was accepted by Pattern Recognition.
26. March 2024Mingshi's paper "Behavior-Contextualized Item Preference Network for Multi-Behavior Recommendation" was accepted by SIGIR 2024 (CCF-A).
06. March 2024Runhua's paper "Generalizing to Out-of-Sample Degradations via Model Reprogramming" was accepted by IEEE TIP (CCF-A).