Hi! I am Haoxuan Xu (Harrison, 徐浩轩).
I am an MPhil student in System Hub/ROAS Trust at the Hong Kong University of Science and Technology (Guangzhou), advised by Prof. Haoang Li. Previously, I earned my undergraduate degree from the School of Information Science and Engineering at Shandong University (Chongxin College), advised by Prof. Yang Yang.
🎓 Education
💼 Experience
🚀 Research Interests
- Mobile Manipulation
- Vision and Language Navigation
- Computer Vision
My research interests lie in Vision-and-Language Navigation (VLN) and Computer Vision, with a focus on embodied AI for service robotics. Currently, I work on bridging advanced machine learning techniques with real-world applications, particularly in developing adaptive navigation systems that interpret natural language instructions and dynamic environments.
If you are interested in any aspect of me, I am always open to discussions and collaborations. Feel free to reach out to me at - hxu095 [at] connect.hkust-gz.edu.cn
📝 Publications († denotes equal contribution)

P3Nav: End-to-End Perception, Prediction and Plannning for Vision-and-Language Navigation
ArXiv Preprint
Tianfu Li†, Wenbo Chen†, Haoxuan Xu†, et al.
- Unified perception, prediction, and planning in a single VLN network, using intermediate modules to sharpen scene understanding and boost navigation accuracy.

Cross-domain Car Detection Model with Integrated Convolutional Block Attention Mechanism
Image and Vision Computing (JCR Q1, IF:4.7, CCF-C)
Haoxuan Xu†, Songnung Lai†, Yang Yang~
- Proposed a complete cross-domain detection framework with an integrated CBAM architecture and GIOU loss optimization.

International Journal of Disaster Risk Science (JCR Q1 (IF: 5.0))
Mengfan Shen, Haoxuan Xu, Hongbing Liu and Ziqiang Han~
- Applied topic modeling and sentiment analysis to Weibo posts, identifying key themes and public emotions during international disaster response.

Multimodal Sentiment Analysis: A Survey
Displays (JCR Q1 (IF: 4.3))
Songning Lai, Xifeng Hu, Haoxuan Xu, Zhaoxia Ren~ and Zhi Liu~
- Provides a comprehensive overview of multimodal sentiment analysis, covering its history, datasets, advanced models, and future prospects.

MG-KG: Unsupervised video anomaly detection based on motion guidance and knowledge graph
Image and Vision Computing (JCR Q1, IF:4.7, CCF-C)
Qiyue Sun, Yang Yang, Haoxuan Xu, Zezhou Li, Yunxia Liu and Hongjun Wang~
- Addresses spatio-temporal linkage and interpretability in VAD by unifying motion-guided prediction with knowledge-graph retrieval.
🔭 Projects

Research and Development of Embodied AI-based Multi-terrain Service Robot
- Used ConceptGraph for open-vocabulary scene mapping and CLIP/GPT4 for object retrieval.
- Implemented optimized A* and KD-Tree for path planning.
- Deployed on Songling chassis for sim-to-real transition.
🎖 Honors and Awards
- Postgraduate Studentship (PGS) Award, HKUST(GZ)
- First Prize, National College Student Mathematical Modeling Competition (Shandong Province)
- Second Prize, 14th National College Student Mathematics Competition
- Outstanding Graduate, Shandong University