I am currently an Associate Researcher at the School of Information Science and Technology, University of Science and Technology of China (USTC). I am a member of the Future Media Computing Lab (未来媒体计算实验室), led by Prof. Xiaojun Chang (常晓军) and Prof. Xun Yang (杨勋).
I completed my postdoctoral researcher job from July 2023 to June 2025 at the School of Information Science and Technology, University of Science and Technology of China, in Hefei, China, under the supervision of Prof. Xun Yang (杨勋). I obtained my bachelor’s and doctoral degrees from the School of Computer Science and Information Engineering, Hefei University of Technology. During my doctoral studies, I was supervised by Prof. Meng Wang (汪萌) and Prof. Dan Guo (郭丹).
My research centers on multimodal intelligence, with a focus on perception, reasoning, and generation across vision, language, and audio modalities. I am particularly interested in how machines can perceive, interpret, and respond to human emotions in a multimodal context. My work spans topics such as cross-modal retrieval, visual captioning, and visual emotion reasoning, with the long-term goal of building empathetic and human-centric AI systems. If you are seeking any form of academic cooperation, please feel free to contact me.
🔥 News
- 2025.08: 🎉One paper is accepted by EMNLP Findings!
- 2025.07: 🎉Three paper are accepted by ACM MM!
- 2025.05: 🎉One paper is accepted by IEEE TMM!
- 2025.02: 🎉One paper is accepted by ACM TIST!
- 2025.01: 🎉Two papers are accepted by IEEE TMM and IEEE TCSVT!
📝 Selected Publications
- Beyond emotion recognition: A multi-turn multimodal emotion understanding and reasoning benchmark, Jinpeng Hu, Hongchang Shi, Chongyuan Dai, Zhuo Li, Peipei Song, Meng Wang, ACM MM 2025
- Video Corpus Moment Retrieval With Query-Specific Context Learning And Progressive Localization, Long Zhang, Peipei Song*, Zhangling Duan, Shuo Wang, Xiaojun Chang, Xun Yang, IEEE TCSVT 2025
- Towards Efficient Partially Relevant Video Retrieval With Active Moment Discovering, Peipei Song, Long Zhang, Long Lan, Weidong Chen, Dan Guo, Xun Yang, Meng Wang, IEEE TMM 2025
- Emotional Video Captioning With Vision-Based Emotion Interpretation Network, Peipei Song, Dan Guo, Xun Yang, Shengeng Tang, Meng Wang, IEEE TIP 2024
- Emotion-Prior Awareness Network For Emotional Video Captioning, Peipei Song, Dan Guo, Xun Yang, Shengeng Tang, Erkun Yang, Meng Wang, ACM MM 2023
- Contextual Attention Network For Emotional Video Captioning, Peipei Song, Dan Guo, Jun Cheng, Meng Wang, IEEE TMM 2022
- Memorial Gan With Joint Semantic Optimization For Unpaired Image Captioning, Peipei Song, Dan Guo, Jinxing Zhou, Mingliang Xu, Meng Wang, IEEE TCYB 2022
- Recurrent Relational Memory Network For Unsupervised Image Captioning, Dan Guo, Yang Wang, Peipei Song*, Meng Wang, IJCAI 2020
🎖 Funds
- 2025.01–2027.12, Young Scientists Fund (C Class), National Natural Science Foundation of China.
- 2025.01–2026.06, General Program, China Postdoctoral Science Foundation.
📖 Experience
- 2025.07–Now, Associate Researcher, School of Information Science and Technology, University of Science and Technology of China, Hefei, China.
- 2023.07–2025.06, Postdoc, School of Information Science and Technology, University of Science and Technology of China, Hefei, China.
- 2017.09–2023.06, PhD, School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, China.
- 2013.09–2017.06, Undergraduate, School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, China.
💻 Services
- Reviewer for ICCV (2025)
- Reviewer for ACM MM (2024/2025)
- Reviewer for AAAI (2022/2023/2024/2025/2026)
- Reviewer for IEEE TMM
- Reviewer for IEEE TNNLS
- Reviewer for IEEE TCSVT
- Reviewer for ACM TOMM
- Reviewer for Pattern Recognition
- Reviewer for Neurocomputing
- …