StableVLA: Towards Robust Vision-Language-Action Models without Extra Data Paper • 2605.18287 • Published 25 days ago • 15
MixSD: Mixed Contextual Self-Distillation for Knowledge Injection Paper • 2605.16865 • Published 27 days ago • 8
VersaViT: Enhancing MLLM Vision Backbones via Task-Guided Optimization Paper • 2602.09934 • Published Feb 10 • 1
Flash-WAM: Modality-Aware Distillation for World Action Models Paper • 2606.05254 • Published 9 days ago • 7
Discrete-WAM: Unified Discrete Vision-Action Token Editing for World-Policy Learning Paper • 2606.05645 • Published 8 days ago • 2
Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation? Paper • 2606.04811 • Published 8 days ago • 16
Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation Paper • 2606.02684 • Published 11 days ago • 16
GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors Paper • 2606.05160 • Published 9 days ago • 8
Unlocking Feature Learning in Gated Delta Networks at Scale Paper • 2606.04048 • Published 10 days ago • 2
SpatialAct: Probing Spatial Reasoning-to-Action Capabilities of VLM Agents in 3D Scenes Paper • 2605.31148 • Published 14 days ago • 3
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 11 days ago • 228
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 15 days ago • 140
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 17 days ago • 139
World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 2 days ago • 22
Reason, Then Re-reason: Cross-view Revisiting Improves Spatial Reasoning Paper • 2606.11683 • Published 2 days ago • 28
On Subquadratic Architectures: From Applications to Principles Paper • 2606.12364 • Published 1 day ago • 21
Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models Paper • 2606.11324 • Published 3 days ago • 8
World Model Self-Distillation: Training World Models to Solve General Tasks Paper • 2606.12072 • Published 2 days ago • 6