Skip to main content

11 docs tagged with "observability"

View all tags

ADR-021: Delegated-Agent Audit Log Storage

Pick a persistent store for delegated_agent audit log entries. Recommend Grafana Cloud Loki (already adopted by ADR-019, free at pilot scale) for general-purpose audit retention, with a small Mongo audit_logs collection reserved only for the customer-facing 'who did what' query surface once a tenant asks for one.

Automated connector pipeline — current-state audit + gap analysis

Current-state audit of sv0-platform's connector → ingest → chain-assemble → evaluate pipeline. Confirms Stream-1 Phases 1-3 already shipped (scheduler, atomic scope claim, execute_scan worker, connector-driver seam, sync→eval→evidence cascade). Identifies the single hard blocker (inert credential broker — `InProcessSubprocessDriver` constructed at `src/index.ts:98` with `env: undefined`) and ten secondary gaps. Drives the decision in ADR-027; lays out the seven-slice migration.

Deployment and Cloud Strategy Research

Deployment strategy options for SecurityV0 from MVP to production scale, including container orchestration paths, observability/logging tradeoffs, CLI operability, and CI/CD automation

Environments & Ops Links

One-page operations index — every SecurityV0 environment and the links to its dashboards, logs, uptime monitors, and infra consoles.

Execution Evidence Fidelity

Honest per-connector accounting of how SecurityV0 derives execution counts, where those numbers match ground truth, and where they don't.

Observability Stack Research — Pre-MediaPro Pilot

Observability stack selection for sv0-platform: evaluates 10 options (Grafana Cloud, self-hosted LGTM, Datadog, BetterStack, Axiom, SigNoz, New Relic, Honeycomb, Cloudflare, stitched-minimum) against cost, agentic access (MCP availability), portability across Hetzner/Azure-VM/AWS-VM, and free-tier allowances. Pick: Grafana Cloud free + BetterStack free + grafana/mcp-grafana.

Operational Resiliency Plan

Operational resiliency plan — error capture, Grafana Cloud log aggregation, external probing, Slack alerting. Middle ground: shared visibility without overengineering.