Vision-Language-Action · Physical Intelligence
π0 Explained: A Vision-Language-Action Flow Model for Robots
π0 bolts a flow-matching action expert onto a pretrained VLM, emitting ~50Hz action chunks so one policy can fold laundry, bus tables, and assemble boxes across single-arm, dual-arm, and mobile robots.