I am currently a final-year undergraduate student at Beijing Institute of Technology (BIT), and will begin my Ph.D. at The University of Hong Kong (HKU) in Fall 2026. I am also a Visiting Research Student at the Big Data Institute (BDI), The Hong Kong University of Science and Technology (Guangzhou) (HKUST GZ), supervised by Prof. Yongqi Zhang.
My research focuses on Video Understanding, World Models, and Agentic Multimodal AI. I am particularly interested in enabling multimodal agents to imagine future states before answering or acting, by integrating predictive world models into multimodal reasoning and decision-making systems.
I am currently a Research Intern at Tencent Yuanbao (CSIG), working on Multimodal RAG and agentic multimodal reasoning.
If you have internship opportunities related to Video Understanding & Generation, please feel free to reach out to me via email!
Email: enjundu.cs@gmail.com
The Hong Kong University of Science and Technology
Visiting Student
Beijing Institute of Technology
Bachelor of Cyberspace Science and Technology
My research lies at the intersection of Video Understanding, World Models, and Agentic Multimodal AI. Concretely, I study how world models can serve as internal simulators for multimodal agents, enabling them to mentally roll out future visual trajectories, infer hidden dynamics, and evaluate potential actions before committing to a response or decision. My research spans video understanding and reasoning, predictive video generation, world model learning, and agentic multimodal systems. My long-term vision is to make imagination a first-class component of multimodal intelligence, enabling agents that are not only more capable, but also more faithful, grounded, and compute-efficient—reliable enough to be trusted in long-horizon real-world decision making under uncertainty.
Here are some of my research works.

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.
May 8, 2026

Participated in the final defense of the Xu Teli Scholarship, the highest honorary scholarship at Beijing Institute of Technology (BIT), awarded with a scholarship prize of RMB 50,000.
Apr 19, 2026

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.
Mar 8, 2026

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.
Feb 8, 2026

Oral presentation at AAAI 2026 on our paper "GraphOracle: Efficient Fully-Inductive Knowledge Graph Reasoning via Relation-Dependency Graphs".
Jan 25, 2026

Oral presentation at EMNLP 2025 on our paper "Mixture of Length and Pruning Experts for Knowledge Graphs Reasoning".
Nov 5, 2025
Enjun Du, Xinyu Zuo, Lisheng Duan, Yongqi Zhang
Enjun Du, Xinyu Zuo, Lisheng Duan, Ruiwen Tao, Yongqi Zhang
Yongqi Zhang, Enjun Du, Siyi Liu
Siyi Liu, Enjun Du, Zirong Chen, Xinyu Zuo, Lisheng Duan, Yongqi Zhang
Zirong Chen, Fuda Ye, Enjun Du, Junfu Pu, Xinyu Zuo, Yongqi Zhang
Wenhao Xue, Zhida Qin, Haoyao Zhang, Shixiao Yang, Enjun Du, Xingbo Tian, Shuang Li, Tianyu Huang
Chinese National Invention Patent, CN119180288A, published December 2024.
I genuinely believe that meaningful progress in academia stems from open dialogue and thoughtful debate. If you have any questions about my research–or if you’ve previously contacted me through GitHub issues and haven’t received a response–please feel free to reach out via email. I’m always happy to chat, collaborate, or offer assistance where I can.
Throughout my academic journey, I’ve been fortunate to receive support and inspiration from many generous people. I’m always eager to give back to the community and engage with others passionate about learning and discovery.
Preferred Email:
Optional Email:
Please avoid contacting me at enjun_du@bit.edu.cn, as I will no longer have access to this address.