Physics filtering favors the generalization of robot learning

1School of Mechanical and Aerospace Engineering, Nanyang Technological University, 50 Nanyang Avenue, 639798, Singapore. 2School of Electrical and Electronic Engineering, Nanyang Technological University, 50 Nanyang Avenue, 639798, Singapore. 3School of Automation Science and Electrical Engineering, Beihang University, Xueyuan Road, Beijing, 100191, China. 4School of Aeronautic Science and Engineering, Beihang University, Xueyuan Road, Beijing, 100191, China. 5China Aerospace Science and Technology Corporation, Fucheng Road, Beijing, 100048, China.
Indicates Equal Contribution

Abstract

Living organisms exhibit extraordinary adaptability to unseen environments through their intrinsic physical structures and lifelong feedback-driven learning. Endowing robots with comparable generalization is critical for reliable operation in the real world. While recent approaches attempt to improve generalization by scaling training data, such strategies remain impractical for robotics, where collecting real-world demonstrations at the scale of large language models is prohibitively costly and slow. Contrary to this reliance on massive datasets, we show that robots can generalize effectively even with limited training data by leveraging a feedback mechanism, namely PhyFilter, that corrects learning outputs with physics-filtered learning residuals. PhyFilter operates as a lightweight, model-agnostic module whose parameters can be automatically optimized through an auto-learning algorithm, eliminating manual tuning and enabling seamless integration with diverse robot policies. We validate PhyFilter across four representative robotic systems, demonstrating that it enables quadruped robots to generalize to unseen terrains, payload variations, and speed ranges; drones to flight under unseen wind disturbances; aerial manipulators to achieve centimeter-level in-air capture despite wind and mass uncertainties; and acceleration differentiators to remain robust with distribution shift. These results reveal a scalable and data-efficient pathway toward generalizable robot learning, showing that physics-filtered feedback can serve as a powerful alternative to massive data scaling.

Quadruped Experiments

Quadruped Simulations

Drone Experiments

Aerial Manipulation Experiments

Contact

If you have any questions, please contact us. jdjiabuaa@126.com; mwangbuaa@126.com