infra tooling¶
Purpose¶
Own the developer operating layer: cbox lifecycle and safety, local/CI dev ergonomics, command surface cleanup, and deployment pathways that make Dataface reliably build, review, and ship. This workstream focuses on operational quality of the toolchain itself rather than product features.
Owner¶
- Sr Engineer Architect
Tasks by Milestone¶
A runnable prototype path exists for developer tooling, cbox runtime reliability, and deployment execution safety, with concrete artifacts that prove the flow works end-to-end in the current codebase. Core assumptions are documented, known constraints are explicit, and the team can explain what is real versus mocked without ambiguity.
- No tasks tracked for this milestone yet.
Internal analysts and engineers can execute at least one weekly real workflow that depends on developer tooling, cbox runtime reliability, and deployment execution safety in the 5T Analytics environment, without bespoke engineering intervention for every run. Instrumentation and feedback capture are in place so failures, friction points, and adoption gaps are visible and triaged with owners.
- Harden cbox sandbox bootstrap with PATH, pre-commit, and git auth health checks — Eliminate recurring sandbox runtime failures by standardizing environment bootstrap and adding startup health checks fo…
- Host Dataface on Fivetran GCP — Stand up and harden the canonical GCP runtime path for internal pilot usage, including deploy auth hardening and intern…
- Add full-name output mode for cbox list — Prevent manager confusion from truncated session names by supporting a full-width/raw list mode.
- Add per-worktree local port bundles for dispatch QA — Allocate unique local serve ports for each dispatched worktree, write a worktree-local ports file similar to .cbox-port…
- Align cbox bootstrap/health docs with actual commands — Fix canonical and wrapper docs where bootstrap and git auth health-check commands diverge from implementation.
- cbox manager default parent-branch policy — Make manager-launched sandboxes default to the manager's active branch (not main) unless explicitly overridden with --p…
- CBox manager interactive stall detection and recovery — Detect and recover manager flows stuck at interactive prompts (e.g. /pr-lite menus, blocker prompts) with deterministic…
- CBox Process: diagnose and hard-fail silent cbox review failures in sandboxes — Observed in M1-INFRA-027 forensic run: sandbox had CBOX_CONTAINER=1 so cbox review should use _run_review_in_tmux, but…
- CBox Process: hard-block PR when cbox review runtime is missing — Observed during M1-INFRA-027: sandbox /pr flow offered 'skip review, open PR' when Docker/Podman missing, leading to PR…
- CBox review prompt context isolation on sandbox restart — Observed restart path where sandbox opened with stale review prompt context (.cbox/.review-prompt.md flow). Ensure sand…
- CBox sandbox bootstrap health parity for python and pre-commit — Repeatedly observed on fresh sandbox start: bootstrap health checks fail for python and pre-commit immediately after se…
- CBox sandbox git metadata path isolation — Fix sandbox git commands failing because worktree metadata points to host paths (e.g., '/Users/.../.git/packed-refs').…
- CBox sandbox session liveness drop detection and recovery — Observed manager incident: sandbox session disappeared ('No session found') while worktree/branch remained intact. Add…
- cbox sandbox sessions can exit unexpectedly during long task handoff — Track issue 423 in master_plans after retiring GitHub Issues as the active backlog.
- CBox sandbox startup-timeout diagnostics — Surface actionable diagnostics when wait_for_prompt times out during sandbox or review startup, replacing opaque "Timeo…
- cbox send false-positive delivery when sandbox TUI ignores input — Investigate and fix cases where cbox send reports success after tmux send-keys, but the target sandbox Claude TUI does…
- CBox session registry stale after sandbox kill — Observed during manager cleanup: 'cbox new --list' continued showing a killed sandbox while 'cbox list' showed no sessi…
- CBox setup-worktree ROOT_WORKTREE_PATH fallback — Harden worktree setup so cp from root .env succeeds when ROOT_WORKTREE_PATH is unset by deriving the root path from git…
- Configurable review timeouts and stall detection — Add configurable review timeout (CLI flag + env var) with 20m default, and stall detection that distinguishes slow-but-…
- Decouple task workflow from cbox CLI — add /cbox-task command surface — Keep core cbox generic and make task workflow optional via a composable skill/command layer.
- Improve cbox recovery from hung in-session tool calls — Ensure manager interrupt/send can reliably recover sandboxes stuck in long-running shell tool calls.
- Make cbox entrypoint bootstrap timeout configurable — Replace hard-coded bootstrap timeout values with environment-configurable settings and sane defaults.
- Make qa-explorer use local browser subagent without cbox fallback — Make qa-explorer run through the local subagent/browser path instead of any cbox fallback, ensure the browser automatio…
- Master Plans CLI ergonomics and command wrappers — Make master_plans task tooling easier to run than raw python invocation by adding a user-friendly command entrypoint, c…
- Prevent cbox sandboxes from mutating host git common-dir — Sandbox containers currently mount the host repo common .git directory writable at /workspace/.repo-git. Diagnose and f…
- Reduce cbox sandbox startup latency by parallelizing health checks — Run post-boot sandbox health checks concurrently instead of sequentially to reduce worst-case startup delay.
- Scope just server bindings by execution context — Make localhost the default host binding for local recipes and keep 0.0.0.0 where container access requires it.
- Simplify PR checklist enforcement and reduce brittle PR body sync — Reduce false-negative PR checklist failures by removing redundant gates, consolidating label definitions, and making en…
- Add cbox test command for running visual tests locally in Linux container — Add a repeatable
cboxcommand that runs visual tests locally in the Linux container used by CI. - Consolidate local dispatch and review scripts behind shared implementation — Refactor the new local worktree dispatch and review tooling so scripts/dispatch, scripts/dispatch-kill, scripts/review,…
- Improve merge flow guidance for worktree-bound local branches — Document and handle expected local branch deletion warnings after
gh pr merge --delete-branchwhen branch is checked… - Smarter cbox cleanup: detect squash-merged PRs, ignore sandbox artifacts, scan all worktrees — Make cbox cleanup catch squash-merged branches, ignore .claude-sessions-sandbox/ as dirty state, check commits-ahead, a…
developer tooling, cbox runtime reliability, and deployment execution safety is hardened enough for regular use by multiple internal teams and initial design partners, with a predictable response loop for issues and requests. Quality expectations are documented, and prioritized improvements from real usage are actively incorporated into delivery.
- Master Plans CLI next-stage guidance command — Add an advisory
plans task checkcommand that inspects a task's narrative sections, reports which are incomplete, ide…
Launch scope for developer tooling, cbox runtime reliability, and deployment execution safety is complete, externally explainable, and supportable: user-facing behavior is stable, documentation is publishable, and operational ownership is explicit. Remaining gaps are non-blocking, risk-assessed, and tracked as post-launch follow-up rather than unresolved launch debt.
- No tasks tracked for this milestone yet.
Post-launch stabilization is complete for developer tooling, cbox runtime reliability, and deployment execution safety: recurring incidents are reduced, support burden is lower, and quality gates are enforced consistently before release. The team has a repeatable operating model for maintenance, regression prevention, and measured reliability improvements.
- No tasks tracked for this milestone yet.
v1.2 delivers meaningful depth improvements in developer tooling, cbox runtime reliability, and deployment execution safety based on observed usage and retention signals, not just roadmap intent. Enhancements improve real customer outcomes, and release readiness is demonstrated through metrics, regression coverage, and clear migration guidance where relevant.
- No tasks tracked for this milestone yet.
Long-horizon opportunities for developer tooling, cbox runtime reliability, and deployment execution safety are captured as concrete hypotheses with user impact, prerequisites, and evaluation criteria. Ideas are ranked by strategic value and feasibility so future investment decisions can be made quickly with less rediscovery.
- No tasks tracked for this milestone yet.