Skip to content

OpenAI Codex Computer Use, In-App Browser, and Visual QA

OpenAI Codex Computer Use, In-App Browser, and Visual QA

Section titled “OpenAI Codex Computer Use, In-App Browser, and Visual QA”

Many coding tasks cannot be verified by reading files or running unit tests alone. A button may be misaligned, a modal may trap focus, a chart may overflow, or a desktop app bug may only appear when a human clicks through the interface. Codex desktop adds visual workflow options that help close this gap: in-app browser workflows, screenshots, image generation in some workflows, and computer use for operating graphical interfaces.

The important rule: use visual tools to create evidence, not vibes.

Use the in-app browser first for web apps you are building locally. Use computer use when Codex needs to operate a desktop app or a graphical workflow that cannot be reached through files, commands, or structured integrations. Require evidence after every visual task: screenshots, reproduction steps, viewport details, changed files, and what was verified.

TaskPreferWhy
Local web app UI checkIn-app browserBuilt for web workflows and comments on pages
Frontend screenshot comparisonIn-app browser or browser automation skillEasier to reproduce and capture evidence
Desktop app settingsComputer useRequires GUI interaction outside web app
iOS simulator or native app QAComputer use or dedicated simulator toolingVisual state matters
Data in a GUI-only appComputer useNo structured connector available
Gmail, Drive, Slack, GitHub dataPlugin or connector firstStructured access is safer and repeatable
Browser workflow with credentialsStructured integration first, computer/browser only if necessaryVisible content and credentials create risk

Computer use is powerful because it can operate graphical interfaces. It is also high-context and higher-risk because it may see screen content, screenshots, app windows, and clipboard state.

Open the local preview in the in-app browser and inspect the pricing page
at desktop and mobile widths.
Check:
- hero layout;
- CTA visibility;
- horizontal overflow;
- font size and line height;
- card spacing;
- footer readability.
If you find an issue, make the smallest CSS or component change.
After each change, re-check the same viewport.
Return:
- viewport sizes tested;
- screenshots or visual observations;
- files changed;
- whether there is remaining risk.

This prompt makes the visual task bounded. It does not ask Codex to “make it beautiful” without constraints.

Use visual QA for:

  • responsive layout changes;
  • landing pages;
  • dashboard changes;
  • interactive forms;
  • screenshot-to-code work;
  • game or canvas behavior;
  • accessibility-affecting UI flows;
  • PDF, slide, spreadsheet, or document rendering checks;
  • regressions where the failing behavior is visible but not captured by tests.

Do not use visual QA as a substitute for:

  • unit tests;
  • type checks;
  • linting;
  • deterministic accessibility checks;
  • API contract tests;
  • snapshot or screenshot tests where stable automation already exists.

Visual review and deterministic checks should reinforce each other.

Before asking Codex to use computer use:

  1. Name the exact app.
  2. Name the exact flow.
  3. State what Codex may click or type.
  4. Keep secrets out of visible areas when possible.
  5. Avoid finance, identity, production admin, or irreversible workflows.
  6. Stay present for sensitive flows.
  7. Prefer plugins or MCP servers when structured access exists.

Example:

Use computer use to open the local desktop app only.
Do not open browser password managers, email, or production admin pages.
Reproduce the onboarding bug by clicking New Project, selecting Blank,
and pressing Continue. If the app crashes or blocks, stop and report
the visible state before editing files.

A Codex visual QA result should include:

  • what was opened;
  • viewport or app window state;
  • steps taken;
  • issue observed;
  • files changed;
  • before and after observation;
  • commands run;
  • residual risks;
  • any permission prompt encountered.

Weak result:

Looks good now.

Strong result:

Tested /pricing/ at 1440x900 and 390x844 in the in-app browser. The mobile CTA wrapped below the hero image and caused horizontal overflow. Updated PricingHero.astro to change the grid breakpoint and constrain the image width. Rechecked both viewports. No horizontal scroll observed. Did not test Safari.

The second result is useful because it tells a reviewer what to trust and what not to trust.

For content sites and tool pages, Codex visual QA can support:

  • screenshot-based tutorials;
  • UI comparison articles;
  • “before and after” implementation guides;
  • responsive design repair playbooks;
  • Markdown-to-image workflows;
  • prompt-to-landing-page examples;
  • app setup guides with visual steps.

Those pages can attract qualified readers because they solve real problems that ordinary text-only articles do not solve. But the visuals must be useful evidence, not decoration.

This page is based on OpenAI’s Codex computer use documentation, Codex in-app browser documentation, and Codex for almost everything.