Vision vs Delivered - Automation-First Realignment Plan (Codex Version)
Date: 2026-02-11
Status: Draft - separate comparison version
1. Executive Assessment
The platform has strong foundations (ingestion, evaluator, evidence packs, graph, temporal endpoints), but delivery is still mostly identity/findings-centric rather than automation execution-centric.
The biggest mismatch is not "missing infrastructure"; it is missing product semantics:
- Automation is not treated as the primary unit of investigation in UI and evaluator outputs.
- Event ordering is weakly aligned to source chronology.
- Ingestion is still connector-normalized batch processing, not a full platform-side correlation model.
- Business-flow continuity is not modeled separately from technical identity instances.
2. Vision-to-Delivery Comparison
2.1 What aligns well
- Deterministic, evidence-oriented architecture exists.
- Cross-system relationships and execution paths are materialized.
- Findings and evidence packs are generated and versioned.
- Graph focus mode and subgraph APIs exist.
2.2 Where current delivery diverges
- MVP1 progress tracker still shows open requirements for core per-automation outputs (inventory, execution detection, egress, origin, risk grouping, evidence pack assembly).
- Connector outputs rich automation fields, but platform evaluator/UI do not systematically consume those fields.
- Diff events are timestamped using ingestion time, causing potential chronology distortion.
temporalMarkersare accepted at ingest but not used for ordering or visualization.- Execution-flow traversal is not strict to canonical flow semantics and remains partially generic.
- Graph UI has stale relationship filters/styles that drift from current domain relationship set.
- Timeline UI contract appears misaligned with backend event shape (
details/sync_versionvschange_details/sync_id).
3. Must-Change Decisions
3.1 Product model: automation-first
Make automation flow the primary operator object:
- Add a dedicated automation inventory API and page.
- Provide per-automation state cards for: execution proof, identity binding, origin, egress, ownership, risk group.
- Keep findings as consequences, not primary navigation.
3.2 Temporal correctness
Adopt explicit timestamp semantics:
effective_at(source event time)observed_at(connector fetch time)ingested_at(platform persist time)
Timeline, ordering, and replay should default to effective_at.
3.3 Canonical execution graph semantics
Enforce flow templates in graph APIs:
- Forward mode:
origin -> trigger -> automation -> auth chain -> role/permission -> resource -> egress - Reverse mode:
egress/resource -> ... -> automation -> origin
Avoid mixing causal flow edges and generic neighborhood expansion in execution mode.
3.4 Ingestion and correlation redesign
Move from "connector submits final normalized chain" to staged platform pipeline:
- Fact import (all entities/edges/events)
- Correlation/identity resolution against existing graph
- Connection updates + low-risk deletion handling
- Flow projection/materialization
- Evaluation and evidence generation
4. UI Realignment Plan
4.1 Core UX objective
Users should answer this in under one minute:
"What automations are running, what data do they touch, where does data leave, who is accountable, and what changed recently?"
4.2 Display options for automation flow
Option A (recommended): Flow lanes
Fixed columns:
- Origin
- Trigger
- Automation
- Run-As/Auth
- Authority
- Resource
- Egress
Pros: highest clarity and strongest PRD alignment.
Option B: Split graph + event sequence
- Left: canonical chain
- Right: ordered event sequence by
effective_at
Pros: best for investigations with change context.
Option C: Query-based path explorer
Path search from source/egress constraints, returns ranked deterministic chains.
Pros: powerful for advanced operators; steeper UX.
4.3 Recommended UI sequence
- Automation Overview (new default landing)
- Automation Detail (chain + evidence + state)
- Graph Explorer (deep dive mode only)
- Findings (operational queue after automation triage)
5. Business vs Technical Identity Continuity
Problem: technical identifier changes (for example, client ID rotation) can look like a new execution chain, while business intent is unchanged.
Solution: two-level model.
automation_instancefor technical runtime identities and credentials.business_flowfor stable business process identity.
Linking strategy:
INSTANCE_OFfrom automation instance to business flow.SUPERSEDESbetween technical instances across rotations.- Deterministic business-flow fingerprint from trigger, target domain/system, operation type, and accountable function.
Outcome: technical churn is preserved without losing business continuity.
6. Ingestion Architecture Upgrade (Target State)
6.1 Pipeline stages
- Import: ingest source facts without over-collapsing.
- Resolve: correlate with existing entities via deterministic identity keys + alias map.
- Reconcile: update/new edges and controlled deletions.
- Project: recompute execution and automation flow views.
- Evaluate: run rule engine at flow and instance level.
- Publish: update UI projections and evidence references.
6.2 Continuous update model
- Use persistent sync cursors and retention-safe refresh.
- Support low-rate incremental sync plus webhook-driven updates.
- Recompute only impacted flow neighborhoods, not full tenant graph.
7. Delivery Plan
Phase 0 - Contract lock (1 week)
- Freeze canonical execution flow semantics.
- Align PRD/walkthrough/progress language with actual behavior.
- Define API contracts for automation-first pages.
Phase 1 - Data model and ingestion core (2 weeks)
- Add
business_flowand mapping tables. - Add durable ingest idempotency ledger.
- Activate cursor-based incremental processing in platform pipeline.
Phase 2 - Temporal and event correctness (1 week)
- Add multi-timestamp event model.
- Update timeline APIs and UI adapters.
- Consume
temporalMarkersin ordering logic.
Phase 3 - Evaluator and evidence alignment (2 weeks)
- Make MVP1 automation fields first-class evaluator inputs.
- Add automation/flow-level findings where needed.
- Extend evidence packs with continuity and flow-chain sections.
Phase 4 - UI automation-first rollout (2 weeks)
- Build Automation Overview and Automation Detail.
- Implement flow-lane visualization + reverse traversal view.
- Keep graph explorer as investigation depth tool.
Phase 5 - Validation and rollout hardening (1 week)
- Backfill/migration for existing entities.
- Regression suite for ordering, continuity, and traversal semantics.
- Pilot validation with deterministic replay checks.
8. Success Criteria
- Analysts can trace any automation from origin to egress (and reverse) in three clicks or fewer.
- Timeline ordering is source-accurate and reproducible.
- Client ID/credential rotations preserve business-flow continuity.
- MVP1 per-automation outputs are visible and actionable in UI without custom filtering.
- Continuous low-rate update mode maintains graph correctness without full-sync dependence.
9. Implementation Notes
- This document is intentionally separate from
plans/2026-02-10-vision-vs-delivered-analysis.mdfor side-by-side comparison. - It prioritizes architecture and UX semantics over connector feature inventory.
- It is intended to drive engineering tickets across
sv0-platform,sv0-connectors, and docs.