OpenAI Codex Computer Use, In-App Browser, and Visual QA
OpenAI Codex Computer Use, In-App Browser, and Visual QA
Section titled “OpenAI Codex Computer Use, In-App Browser, and Visual QA”Many coding tasks cannot be verified by reading files or running unit tests alone. A button may be misaligned, a modal may trap focus, a chart may overflow, or a desktop app bug may only appear when a human clicks through the interface. Codex desktop adds visual workflow options that help close this gap: in-app browser workflows, screenshots, image generation in some workflows, and computer use for operating graphical interfaces.
The important rule: use visual tools to create evidence, not vibes.
Quick answer
Section titled “Quick answer”Use the in-app browser first for web apps you are building locally. Use computer use when Codex needs to operate a desktop app or a graphical workflow that cannot be reached through files, commands, or structured integrations. Require evidence after every visual task: screenshots, reproduction steps, viewport details, changed files, and what was verified.
In-app browser vs computer use
Section titled “In-app browser vs computer use”| Task | Prefer | Why |
|---|---|---|
| Local web app UI check | In-app browser | Built for web workflows and comments on pages |
| Frontend screenshot comparison | In-app browser or browser automation skill | Easier to reproduce and capture evidence |
| Desktop app settings | Computer use | Requires GUI interaction outside web app |
| iOS simulator or native app QA | Computer use or dedicated simulator tooling | Visual state matters |
| Data in a GUI-only app | Computer use | No structured connector available |
| Gmail, Drive, Slack, GitHub data | Plugin or connector first | Structured access is safer and repeatable |
| Browser workflow with credentials | Structured integration first, computer/browser only if necessary | Visible content and credentials create risk |
Computer use is powerful because it can operate graphical interfaces. It is also high-context and higher-risk because it may see screen content, screenshots, app windows, and clipboard state.
Visual QA prompt pattern
Section titled “Visual QA prompt pattern”Open the local preview in the in-app browser and inspect the pricing pageat desktop and mobile widths.
Check:- hero layout;- CTA visibility;- horizontal overflow;- font size and line height;- card spacing;- footer readability.
If you find an issue, make the smallest CSS or component change.After each change, re-check the same viewport.
Return:- viewport sizes tested;- screenshots or visual observations;- files changed;- whether there is remaining risk.This prompt makes the visual task bounded. It does not ask Codex to “make it beautiful” without constraints.
When visual QA is worth the cost
Section titled “When visual QA is worth the cost”Use visual QA for:
- responsive layout changes;
- landing pages;
- dashboard changes;
- interactive forms;
- screenshot-to-code work;
- game or canvas behavior;
- accessibility-affecting UI flows;
- PDF, slide, spreadsheet, or document rendering checks;
- regressions where the failing behavior is visible but not captured by tests.
Do not use visual QA as a substitute for:
- unit tests;
- type checks;
- linting;
- deterministic accessibility checks;
- API contract tests;
- snapshot or screenshot tests where stable automation already exists.
Visual review and deterministic checks should reinforce each other.
Computer use safety rules
Section titled “Computer use safety rules”Before asking Codex to use computer use:
- Name the exact app.
- Name the exact flow.
- State what Codex may click or type.
- Keep secrets out of visible areas when possible.
- Avoid finance, identity, production admin, or irreversible workflows.
- Stay present for sensitive flows.
- Prefer plugins or MCP servers when structured access exists.
Example:
Use computer use to open the local desktop app only.Do not open browser password managers, email, or production admin pages.Reproduce the onboarding bug by clicking New Project, selecting Blank,and pressing Continue. If the app crashes or blocks, stop and reportthe visible state before editing files.Evidence packet
Section titled “Evidence packet”A Codex visual QA result should include:
- what was opened;
- viewport or app window state;
- steps taken;
- issue observed;
- files changed;
- before and after observation;
- commands run;
- residual risks;
- any permission prompt encountered.
Weak result:
Looks good now.
Strong result:
Tested
/pricing/at 1440x900 and 390x844 in the in-app browser. The mobile CTA wrapped below the hero image and caused horizontal overflow. UpdatedPricingHero.astroto change the grid breakpoint and constrain the image width. Rechecked both viewports. No horizontal scroll observed. Did not test Safari.
The second result is useful because it tells a reviewer what to trust and what not to trust.
Visual tasks that can drive traffic
Section titled “Visual tasks that can drive traffic”For content sites and tool pages, Codex visual QA can support:
- screenshot-based tutorials;
- UI comparison articles;
- “before and after” implementation guides;
- responsive design repair playbooks;
- Markdown-to-image workflows;
- prompt-to-landing-page examples;
- app setup guides with visual steps.
Those pages can attract qualified readers because they solve real problems that ordinary text-only articles do not solve. But the visuals must be useful evidence, not decoration.
Related paths
Section titled “Related paths”Source notes
Section titled “Source notes”This page is based on OpenAI’s Codex computer use documentation, Codex in-app browser documentation, and Codex for almost everything.