Bio

I obtained my Ph.D. degree in School of Science and Engineering of The Chinese University of Hong Kong, Shenzhen (CUHK-SZ). I was lucky to be advised by Prof. Rui Huang, Dr. Tao Mei and Prof. Chang-Wen Chen. Previously, I worked as a researcher and postdoctoral fellow at Sangfor Technologies and Chinese Academy of Sciences, where I participated in building China's first cybersecurity LLM Sangfor Security GPT and the CoStrict AI coding agent.
Recently, I joined the School of Artificial Intelligence at Shenzhen University as an Assistant Professor. My research areas cover large models and agents, computer vision, vision-language multimodal learning, and video understanding, with applications in multimedia content analysis, AI medical, and AI4Science, aiming to solve real-world problems with advanced AI technology.

I am recruiting master's students for 2026, welcome to contact me [See details]. Students who are interested in my research are also welcome to contact me. I am open to research collaboration—feel free to reach out!
我正在招收2026届入学的硕士[详情],欢迎联系。同时也欢迎任何对我研究感兴趣的同学,随时与我交流合作!

News

  • [Always~] I am looking for self-motivated Undergraduate and Graduate students. Feel free to contact me for research guidance or collaboration.
  • [2026.03] One paper Boosting Knowledge-based Visual Question Answering with Structured Context Reasoning was accepted to ICME 2026.
  • [2026.03] I joined the School of Artificial Intelligence at Shenzhen University as an Assistant Professor.
  • [2025.11] One paper Appearance-Motion Decomposed Alignment for Text-Video Retrieval was accepted to AAAI 2026.
  • [2023.07] I obtained my Ph.D. degree from CUHK-SZ.

Research Interests

  • Research areas: Large models and agents, computer vision, vision-language multimodal learning, video understanding, etc.
  • Application scenarios: Multimedia content analysis, AI medical, and AI4Science, aiming to solve real-world problems with advanced AI technology.

Education & Experiences

Work 2026.3 – Present
Assistant Professor
School of Artificial Intelligence, Shenzhen University
Work 2023.7 – 2026.3
Researcher & Postdoctoral Fellow
Sangfor Technologies & Chinese Academy of Sciences, Shenzhen
Education 2018.8 – 2023.7
Ph.D. in Computer and Information Engineering
The Chinese University of Hong Kong, Shenzhen (CUHK-SZ)
Joint Ph.D. program with JD.com
▾ Overlapping experience
Intern 2022.12 – 2023.2
Research Intern
International Digital Economy Academy (IDEA), Shenzhen
Intern 2020.8 – 2022.7
Research Intern (Star Intern Award)
Computer Vision and Multimedia Lab, JD Explore Academy, Beijing
Work 2015.7 – 2018.7
Software Engineer
Shenzhen Da-Jiang Innovations Sciences and Technologies Ltd. (DJI), Shenzhen
Education 2011.8 – 2015.6
B.Eng. in Automation
Xi'an Jiaotong University (XJTU)
GPA top 5%
▾ Overlapping experience
Intern 2013.8 – 2015.5
Research Assistant
Systems Engineering Institute, Xi'an Jiaotong University (XJTU)

Selected Publications [Google Scholar]

End-to-End Video Scene Graph Generation with Temporal Propagation Transformer

Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, and Chang-Wen Chen

In IEEE Transactions on Multimedia (TMM), 2023

PDF

Learning to Generate Language-supervised and Open-vocabulary Scene Graph using Pre-trained Visual-Semantic Space

Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, and Chang-Wen Chen

In CVPR, 2023

PDF Code

Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection

Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, and Chang-Wen Chen

In CVPR, 2022

PDF Code

Boosting Scene Graph Generation with Visual Relation Saliency

Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, and Chang-Wen Chen

In ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2023

PDF

Performance analysis of touch-interaction behavior for active smartphone authentication

Chao Shen, Yong Zhang, Xiaohong Guan, and Roy A. Maxion

In IEEE Transactions on Information Forensics and Security (TIFS), 2015

PDF

Touch-interaction behavior for continuous user authentication on smartphones

Chao Shen, Yong Zhang, Zhongmin Cai, Tianwen Yu, and Xiaohong Guan.

In International Conference on Biometrics (ICB), 2015

PDF