Other languages: 中文
- This project is open source and intended for learning and communication only. Closed-source commercial use is prohibited.
- Users must comply with local laws and regulations. Any illegal use is prohibited.
- The developer is not liable for any direct or indirect loss caused by using this project.
- Using this project means you have read and agreed to this disclaimer.
chromemanager-skill connects ChromeManager to external AI agents (such as OpenClaw and Hermes), so agents can call local APIs for window management and run webpage automation actions through debugPort.
This repository includes two standalone folders:
chromemanager-skill-openclawchromemanager-skill-hermes
Both skills provide the same core capabilities. The main differences are integration and trigger behavior.
- Use ChromeManager to batch open/import/select/navigate/close windows
- Execute webpage actions via
debugPort:click/fill/press/extract/screenshot - Support multi-window task orchestration and concurrency control
- Support multi-step
run-planworkflows with automatic progress reporting
Shared base actions:
prepare: open and prepare target windowssnapshot: capture page structure and refsclick: click a page elementfill: fill text into an inputpress: press a keyboard keyextract: extract text from the pagescreenshot: capture page screenshotsdesktop-screenshot: capture the full desktopreport-progress: report status with optional screenshotvalidate-stage: verify page staterecover: reload or re-navigate failed windowsrun-plan: run a multi-step workflow
For both OpenClaw and Hermes editions:
- Place the target skill folder under the agent
skillsdirectory - Enter that skill directory
- Run:
npm install- Edit
config.jsonand set:api_token: API token generated in ChromeManagerapi_host: API host address (usually127.0.0.1)api_port: API port (default18923)software_path: local install path of ChromeManager executable (Windows example:D:/***/***/ChromeManager.exe; macOS example:/Applications/***/ChromeManager.app)autoStart: whether to auto-launch ChromeManager when not running (true/false)
- Enable local API in ChromeManager and make sure the token matches
api_token
- Windows (local OpenClaw):
C:\Users\<your-username>\.openclaw\workspace\skills\
- Windows (OpenClaw running in WSL):
~/.openclaw/workspace/skills/
- macOS:
~/.openclaw/workspace/skills/
- Trigger style: supports
/cmand/chromemanager
- Windows (local Hermes):
C:\Users\<your-username>\.hermes\skills\- or Hermes external skills scan directory
- Windows (Hermes running in WSL):
~/.hermes/skills/- or Hermes external skills scan directory
- macOS:
~/.hermes/skills/- or Hermes external skills scan directory
- Trigger style: natural-language trigger (via memory keywords) is recommended
You can ask for concise usage help directly:
/cm help
/cm 帮助
Important
For first-time setup, configure triggers first:
- OpenClaw:
- Read and load
chromemanager-skill-openclaw/SKILL.md, then bind/cmand/chromemanagertochromemanager-skill-openclawas triggers
- Read and load
- Hermes:
- Read and load
chromemanager-skill-hermes/SKILL.md, then savecm,ChromeManager, andchromemanageras memory triggers, and route them tochromemanager-skill-hermeswith priority
- Read and load
Single-line task example:
/cm prepare 1-5, open https://example.com, click "Check in", then close all pages
You can also split into steps:
/cm prepare 1-5
/cm open https://example.com
/cm click "Check in"
/cm close all pages
If users need to control concurrency, they can describe it directly in natural language:
- "run in batches of 2 windows"
- "run in batches of 3 windows"
- "run all in parallel"
- "run serially"
Recommendations:
- Increase concurrency for "open page" and "close page"
- Keep "click" and "input" more conservative to reduce page-state divergence
Use run-plan for longer workflows, especially when you need:
- multi-step chains (for example: prepare -> navigate -> fill -> submit -> validate)
- step-to-step result reuse
- conditional branches, failure policies, and concurrency control
Default behavior:
run-planappends an automaticreport-progressat the endautoReportOnPlanEnddefault:trueautoReportCapturedefault:desktop(setnoneto disable screenshots)autoReportStrictdefault:false(auto-report failure does not fail the whole plan)autoReportFailedQuestionlets you customize the follow-up question when blocked
Recommended practice:
- send one
report-progressafter each key stage - when validation/recovery fails or wallet confirmation is blocked, screenshot first and report
- include: current stage, affected windows, result status, and next-step question