Senior SRE · Incident Commander · Speaker
Calm in the chaos.
Reliability by design.
20+ years architecting, operating, and rescuing mission-critical platforms — from eBay-scale e-commerce to high-growth fintech. I help engineering organizations build the muscle to handle incidents with clarity and grace.

Incident Command
Led IC for global outages at eBay, Upstart, and Dynascale. Built Rootly, PagerDuty, and ServiceNow workflows that cut MTTR and restored executive trust.
Reliability Engineering
99.997% uptime SLA on mission-critical Unix fleets. Eliminated 90% of operational toil through automation, self-healing, and ruthless standardization.
Speaking & Mentoring
Talks on Incident Command, blameless postmortems, and SRE culture — tuned for startup founders through enterprise platform teams.
Featured work
Things I've built
Incident Commander HQ
A training platform and live simulator for production incident command — ICS principles, escalation trees, and blameless postmortem workflows.
incidentcommandhq.com
Hit Director
Managed web hosting and design studio — high-traffic Linux infrastructure built on two decades of hardened ops experience.
hitdirector.com
Bring real incident command experience to your team.
Available for keynotes, workshops, and internal training on Incident Command, Incident Management, and Site Reliability — from early-stage startups to enterprise engineering orgs.
Start the conversation