Xiangyu Yue
Assistant Professor
Multimedia Laboratory (MMLab)
Department of Information Engineering
The Chinese University of Hong Kong
Email: xyyue [at] ie.cuhk.edu.hk
I am an Assistant Professor in the Department of Information Engineering at The Chinese University of Hong Kong, with the Multimedia Laboratory (MMLab). I received my Ph.D. in Electrical Engineering and Computer Science from the University of California, Berkeley, working with Prof. Alberto Sangiovanni Vincentelli and Prof. Kurt Keutzer at Berkeley AI Research.
Prior to Berkeley, I received my M.S. degree from Stanford University and B.S. degree from Nanjing University. I have spent time at Google Research, Google [x] Robotics, Baidu AI Research, and Tencent AI Lab. I received the Lotfi A. Zadeh Award for my research.
Openings
Prospective students and researchers: I have multiple fully funded Ph.D. positions for 2027, as well as Post-Doc, RA, visiting student, and intern opportunities. Please email me if you are interested and highlight any funding sources or support you may already have.
Selected Publications
Twins: Learn to Predict Unified Representations with Focal Loss
International Conference on Machine Learning (ICML), 2026
MVISTA-4D: View-Consistent 4D World Model with Test-Time Action Inference for Robotic Manipulation
International Conference on Machine Learning (ICML), 2026
SpatialLogic-Bench: A Diagnostic Benchmark for Task-Oriented Spatiotemporal Reasoning
AAAI Conference on Artificial Intelligence (AAAI), 2026
StyleDoctor: Towards Specialist Reward Model for Style-centric Generation Tasks
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026
Evolve Vision-Language-Action Model into an Agent with On-the-fly Tool-use
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Findings, 2026
PreciseCache: Precise Feature Caching for Efficient and High-fidelity Video Generation
International Conference on Learning Representations (ICLR), 2026
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
International Conference on Learning Representations (ICLR), 2026
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
International Conference on Learning Representations (ICLR), 2026
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
International Conference on Learning Representations (ICLR), 2026
Consistent Noisy Latent Rewards for Trajectory Preference Optimization in Diffusion Models
International Conference on Learning Representations (ICLR), 2026
Video-R1: Reinforcing Video Reasoning in MLLMs
NeurIPS 2025 Most Influential Paper Top 10
Advances in Neural Information Processing Systems (NeurIPS), 2025
Chimera: Improving Generalist Model with Domain-Specific Experts
IEEE/CVF International Conference on Computer Vision (ICCV), 2025
CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation
IEEE/CVF International Conference on Computer Vision (ICCV), 2025
FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions
IEEE/CVF International Conference on Computer Vision (ICCV), 2025
Breaking the Encoder Barrier for Seamless Video-Language Understanding
IEEE/CVF International Conference on Computer Vision (ICCV), 2025
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations
Preprint, 2024