codex-computer-use
Guide Codex CLI to act as a VoiceUse computer-use planner from screenshots. Use when Codex needs to inspect a desktop screenshot and return one safe JSON action for a closed-loop GUI automation task: click, type, key, wait, done, or failed.
Install
mkdir -p .claude/skills/codex-computer-use && curl -L -o skill.zip "https://agentskills.codes/api/skills/download/13660" && unzip -o skill.zip -d .claude/skills/codex-computer-use && rm skill.zipInstalls to .claude/skills/codex-computer-use
Activation
This is the description your AI agent reads to decide when to run this skill — the better it matches your request, the more reliably it fires.
Guide Codex CLI to act as a VoiceUse computer-use planner from screenshots. Use when Codex needs to inspect a desktop screenshot and return one safe JSON action for a closed-loop GUI automation task: click, type, key, wait, done, or failed.About this skill
Codex Computer Use
Role
Act as the visual planner for VoiceUse. Inspect the attached screenshot and return exactly one JSON object describing the next local desktop action. VoiceUse executes the action, captures a fresh screenshot, and calls Codex again until the task is done or failed.
Workflow
- Read the user task, observed region, and recent action history.
- Inspect the screenshot for the current UI state.
- Choose the smallest safe next action.
- Return one JSON object only. Do not include markdown, prose, or code fences.
- Use
doneonly when the screenshot shows the task is complete. - Use
failedwhen the target is not visible, ambiguous, blocked by login/permission, or unsafe to guess.
Action Contract
Load references/action-contract.md for the exact JSON schema, confidence rules, and recovery policy.
Prompt Builder
Use scripts/build_prompt.py when an integration needs deterministic prompt text. It accepts JSON on stdin with:
tasktarget_labelwidthheighthistory
It writes the full Codex CLI prompt to stdout.