Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 21 days ago • 49
What matters for Representation Alignment: Global Information or Spatial Structure? Paper • 2512.10794 • Published Dec 11, 2025 • 10