Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published 11 days ago • 48
VI-Net: Boosting Category-level 6D Object Pose Estimation via Learning Decoupled Rotations on the Spherical Representations Paper • 2308.09916 • Published Aug 19, 2023
MG-Nav: Dual-Scale Visual Navigation via Sparse Spatial Memory Paper • 2511.22609 • Published Nov 27, 2025 • 48
SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation Paper • 2311.15707 • Published Nov 27, 2023
DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency Paper • 2103.06526 • Published Mar 11, 2021
Category-Level 6D Object Pose and Size Estimation using Self-Supervised Deep Prior Deformation Networks Paper • 2207.05444 • Published Jul 12, 2022
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17, 2025 • 89
EmbRACE-3K: Embodied Reasoning and Action in Complex Environments Paper • 2507.10548 • Published Jul 14, 2025 • 36
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning Paper • 2506.08889 • Published Jun 10, 2025 • 23