Skip to content

CloudEngineHub/PhyAgentOS

ย 
ย 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

118 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

PhyAgentOS

Physical Agent Operation System (PhyAgentOS)

A Decoupled Protocol-Based Framework for Self-Evolving and Cross-Embodiment Agents

English | ไธญๆ–‡

Version Python License Website

Long Demo

Watch the video

๐Ÿ“– Introduction

Physical Agent Operation System (PhyAgentOS) is a self-evolving embodied AI framework based on Agentic workflows. Moving away from the "black-box" model of traditional "large models directly controlling hardware," PhyAgentOS pioneers a "Cognitive-Physical Decoupling" architectural paradigm. By constructing a Language-Action Interface, it completely decouples action representation from embodiment morphology, enabling standardized mapping from high-reasoning cloud models to edge physical execution layers.

PhyAgentOS utilizes a "State-as-a-File" protocol matrix, natively supporting zero-code migration across hardware platforms, sandbox-driven tool self-generation, and safety correction mechanisms based on Multi-Agent Critic verification.

โœจ Core Features

  • ๐Ÿ“ State-as-a-File: Software and hardware communicate by reading/writing local Markdown files (e.g., ENVIRONMENT.md, ACTION.md), ensuring complete decoupling and extreme transparency.
  • ๐Ÿง  Dual-Track Multi-Agent System:
    • Track A (Cognitive Core): Includes Planner and Critic mechanisms. Large models do not issue commands directly; they must be verified by the Critic against the current robot's runtime EMBODIED.md (copied from profiles) before being committed.
    • Track B (Physical Execution): An independent hardware watchdog (hal_watchdog.py) monitors and executes commands. Supports both single-instance mode and Fleet mode for multi-robot coordination.
  • ๐Ÿ”Œ Dynamic Plugin Mechanism: Supports dynamic loading of external hardware drivers via hal/drivers/, allowing for new hardware support without modifying core code.
  • ๐Ÿ›ก๏ธ Safety Correction Mechanism: Strict action verification and LESSONS.md experience library prevent Agent workflows from going out of control.
  • ๐ŸŽฎ Simulation Loop: Built-in lightweight simulation support allows verification of the full chain from natural language instructions to physical state changes without real hardware.
  • ๐Ÿ—บ๏ธ Semantic Navigation & Perception: Built-in SemanticNavigationTool and PerceptionService support resolving high-level semantic goals into physical coordinates and constructing scene graphs by fusing geometric and semantic information.

๐Ÿฆพ Showcase

rekep
PhyAgentOS deploys robot arms with one click, no coding required (AgileX PIPER).
rekep
PhyAgentOS quickly connects to XLeRobot, automatically checks the robot's status, and after confirming safety, performs basic chassis movement and dual-arm motion.
rekep
PhyAgentOS achieves natural language-driven grasping tasks through SAM3 (AgileX PIPER).
rekep
PhyAgentOS achieves natural language-driven grasping tasks through ReKep (Dobot Nova 2).
rekep
PhyAgentOS achieves realtime dialog and natural language-driven pick&up task through ReKep (Franka Research 3).

๐Ÿ—๏ธ Architecture

PhyAgentOS's core is a local workspace where software and hardware operate as independent daemons reading/writing files:

PhyAgentOS

๐Ÿš€ Quick Start

1. Install Dependencies

git clone https://github.com/PhyAgentOS/PhyAgentOS.git
cd PhyAgentOS
pip install -e .
# Install simulation dependencies (e.g., watchdog)
pip install watchdog

# Optional: Install external ReKep real-world plugin
python scripts/deploy_rekep_real_plugin.py \
  --repo-url https://github.com/baiyu858/PhyAgentOS-rekep-real-plugin.git

2. Initialize Workspace

paos onboard

This generates core Markdown protocol files in the current workspace. Single-instance mode defaults to ~/.PhyAgentOS/workspace/; Fleet mode uses a shared workspace and multiple robot workspaces under ~/.PhyAgentOS/workspaces/.

3. Start the System

Open two terminals:

Terminal 1: Start Hardware Watchdog & Simulation (Track B)

python hal/hal_watchdog.py

To pass driver-specific runtime configuration without specializing the watchdog CLI, use:

python hal/hal_watchdog.py --driver <driver_name> --driver-config path/to/driver.json

The config file must be a JSON object and its keys are passed through to the selected driver constructor unchanged.

To use real-world ReKep instead of simulation, install the plugin and run:

python hal/hal_watchdog.py --driver rekep_real

Terminal 2: Start Brain Agent (Track A)

paos agent

4. Interaction Example

In the paos agent CLI, input:

"Look at what is on the table, then grasp that apple for me."

You will see the action execution in the simulation logs in Terminal 1, and receive completion confirmation from the Agent in Terminal 2.

  1. More Information

User Manual: Operations guide for end users, integrators, and demo operators.

Development Guide: Layered guide for secondary developers, hardware integrators, plugin authors, and maintainers.

๐Ÿ“ Project Structure

Physical Agent Operating System/
โ”œโ”€โ”€ PhyAgentOS/                # Track A: Software Brain Core
โ”‚   โ”œโ”€โ”€ agent/              # Agent Logic (Planner, Critic)
โ”‚   โ”œโ”€โ”€ templates/          # Workspace Markdown Templates
โ”‚   โ””โ”€โ”€ ...
โ”œโ”€โ”€ hal/                    # Track B: Hardware HAL & Simulation
โ”‚   โ”œโ”€โ”€ hal_watchdog.py     # Hardware Watchdog Daemon
โ”‚   โ””โ”€โ”€ simulation/         # Simulation Environment Code
โ”œโ”€โ”€ scripts/                # External HAL Plugin Deployment
โ”‚   โ””โ”€โ”€ deploy_rekep_real_plugin.py
โ”œโ”€โ”€ workspace/              # Single-instance Runtime Workspace
โ”‚   โ”œโ”€โ”€ EMBODIED.md         # Runtime Robot Profile
โ”‚   โ”œโ”€โ”€ ENVIRONMENT.md      # Current Scene-Graph
โ”‚   โ”œโ”€โ”€ ACTION.md           # Pending Action Commands
โ”‚   โ”œโ”€โ”€ LESSONS.md          # Failure Experience Records
โ”‚   โ””โ”€โ”€ SKILL.md            # Successful Workflow SOP
โ”œโ”€โ”€ workspaces/             # Fleet Topology
โ”‚   โ”œโ”€โ”€ shared/             # Agent Workspace & Global ENVIRONMENT.md
โ”‚   โ”œโ”€โ”€ go2_edu_001/        # Robot-local ACTION.md / EMBODIED.md
โ”‚   โ””โ”€โ”€ ...
โ”œโ”€โ”€ docs/                   # Project Documentation
โ”‚   โ”œโ”€โ”€ PLAN.md             # Detailed Implementation Plan
โ”‚   โ””โ”€โ”€ PROJ.md             # Project Whitepaper & Architecture
โ”œโ”€โ”€ README.md               # English Documentation
โ””โ”€โ”€ README_zh.md            # Chinese Documentation

๐Ÿ—บ๏ธ Roadmap

  • Phase 1: Desktop Loop & Markdown Protocol Establishment.
    • v0.0.1: Framework Design & Initialization
    • v0.0.2: Embodied Skill Plugin Deployment & Invocation Design
    • v0.0.3: Visual Decoupling + Grasping Pipeline (SAM3 & ReKep)
    • v0.0.4: Atomic Action-based VLN Pipeline (SAM3)
    • v0.0.5: Multi-Agent Protocol Design
    • v0.0.6: Long-horizon Task Decomposition, Orchestration & Execution
    • v0.0.7: IoT Device Integration (e.g., XiaoZhi)
  • Phase 2: Multi-Embodiment Coordination & Multi-modal Memory.
  • Phase 3: Constraint Solving & High-level Heterogeneous Coordination.

๐Ÿ› ๏ธ Supported Devices

PhyAgentOS supports various embodiment types through the HAL (Hardware Abstraction Layer) protocol.

Embodiment Type Robot Status Remarks
Desktop Robot Arm AgileX PIPER ๐ŸŸข Verified Full-chain verified with ReKep & SAM3
Composite Robot AgileX PIPER + Unitree Go2 ๐ŸŸก Partial locomotion adaptation in progress
Desktop Robot Arm Dobot Nova 2 ๐ŸŸข Verified ReKep deployment verified
Quadruped Robot Unitree Go2 ๐ŸŸก Partial Currently supports mobility and semantic navigation
Dual-Arm Control XLeRobot ๐ŸŸข Verified Deployment and dual-arm capture have now been achieved
IoT Device XiaoZhi (ESP32) ๐ŸŸก Partial Currently supports voice dialogue interaction
Industrial Robot Franka Research 3 ๐ŸŸข Verified Visual Reasoning and capture have been achieved
Edu Robot Hiwonder Series ๐Ÿ”ด Unsupported Awaiting driver plugin development
General Environment Built-in Simulator ๐ŸŸข Verified Lightweight simulation based on disk mapping

Note: PhyAgentOS is designed with a plugin architecture. Any hardware that supports a Python control interface can be quickly integrated via hal/drivers/. A community plugin template is available at docs/user_development_guide/PLUGIN_DEVELOPMENT_GUIDE.md, with the Chinese version at docs/user_development_guide/PLUGIN_DEVELOPMENT_GUIDE_zh.md.

๐Ÿค Contribute

PRs and Issues are welcome! Please refer to docs/user_development_guide/README.md for detailed architecture design and development guidelines.


Special Thanks: This project is developed based on nanobot, thanks for providing the lightweight Agent framework. Everyone is welcome to go to the nanobot repository and give it a star!

Affiliations

SYSU HCP HCP

We welcome any individual or team to join as a contributor๏ผ

About

PhyAgentOS is a self-evolving embodied AI operating system built on agentic workflows.

Resources

License

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 98.0%
  • TypeScript 1.1%
  • Other 0.9%