Base 42 — Run Together

Launch day is just Day 1. Base 42 is our managed-ops layer that keeps your product fast, secure, and always shipping. It blends light Site-Reliability Engineering (SRE) with product-ops so adoption and uptime rise in tandem.

1 · Why Base 42?

Production PainHow Base 42 Solves It
“Who's on pager duty?”24 × 7 on-call rotation, first-line triage in < 15 min.
Bug found → no one can roll backGit tag + blue/green release flow with instant revert.
Metrics in ten dashboards, none linked to featuresUnified Grafana board tying KPIs, errors, logs, and cost to each epic.
Roadmap stalls under unplanned maintenanceMonthly optimisation sprint keeps feature velocity ≥ 80 % of dev budget.

2 · Managed-Hosting Options

TierStackIdeal ForSLA¹Typical Cost²
BYOC (Bring your own cloud)AWS, Azure, GCP, on-prem Docker/K8sEnterprises with internal DevOpsAdvisory only5–10 % of dev spend
Edge Cloud (default)Fly.io + PostgresSaaS & fintech MVPs needing global latency99.9 %12–16 %
HybridFly.io front, AWS data plane, Salesforce or SAP integrationsRegulated workloads99.95 % core, 99.9 % edge15–20 %
Full-ServiceEdge Cloud + CI/CD + DaaS squadScale-ups wanting "no-ops"99.95 % + hot-fix < 1 h18–22 %

¹ Monthly uptime.

² Percentage of monthly engineering run-rate (DaaS or Pod).

All tiers include Datadog, Sentry, Statuspage and automated SSL rotation.

3 · Ops Lifecycle

PhaseWhat HappensArtefacts
Product-Ops Sprint (monthly)Prioritise bugs, A/B tests, infra choresRanked "Run Backlog"
Blue/Green ReleaseTraffic shift with auto-rollback if health < 99 %Release notes
Telemetry & AlertsDatadog SLOs, Sentry error triageReal-time dashboard
Incident ResponsePagerDuty rotation, Slack war-roomRCA doc in 24 h
Post-Mortem & Tech-Debt TicketBlameless review, ticket into Forge backlogJira ticket linked to RCA

4 · Reliability & Performance Budgets

Error Budget

p95 latency target (default 300 ms API, 3 s PWA).

Change Failure Rate

goal ≤ 5 %.

Mean Time to Restore (MTTR)

P1 ≤ 2 h, P2 ≤ 8 h.

Traffic Surge Policy

auto scale 3× in < 60 s on Fly.io; static warm pool for AWS.

5 · Security & Compliance

ControlImplementation
Secrets ManagementDoppler or AWS Secrets Manager — no keys in env files.
Vulnerability ScansNightly Snyk + GitHub Dependabot blobs auto-PR.
Data EncryptionTLS 1.3 in transit; AES-256 at rest.
Audit LogImmutable CloudTrail / Fly Log-Drains; 30-day hot, 1-year cold.
Compliance SupportSOC-2 / ISO 27001 questionnaire pack; HIPAA BAA addendum.

6 · Optimisation & Cost Control

Usage-to-Cost Dashboard

unit cost per 1 k requests, per tenant.

Weekly Anomaly Alerts

spend spike > 15 % triggers Slack ping.

Quarterly Infra Review

right-size instances, prune dead feature flags.

AI-assisted Index Tuning

pgvector and Postgres auto-suggested indexes applied after tests.

Scale-ups on Full-Service tier cut infra cost/MAU by avg 22 % in year 1.

7 · Exit & Portability

  • 14-day data export + Terraform scripts.
  • Handoff meeting, doc stack, and credentials.
  • Optional "Shadow Month" at 50 % fee for knowledge overlap.
  • No vendor lock-in: you own cloud accounts (Edge Cloud uses your Fly.io org).

8 · Mini Case

Retail Flash-Sale Platform — Black-Friday traffic spiked 18×; autoscale kept p95 < 280 ms, zero downtime. Infra spend only +31 % vs baseline due to right-sizing.

9 · FAQs

10 · Call to Action

Want 99.9 % uptime without hiring an SRE team?

Book a Base 42 readiness call—get a tailored hosting plan and cost in 48 h.

Secure My Ops