Collections of ICLR 2026 paper: "OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models"
Zekun Qi
qizekun
AI & ML interests
Embodied Intelligence, Large Langugae Model, 3D Computer Vision
Recent Activity
authored a paper 1 day ago
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model authored a paper 1 day ago
ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models