About
Reliability is a discipline. Calm is a practice.
I'm Jeremy Martinez — a Senior Site Reliability Engineer and Incident Commander based in Las Vegas. For more than two decades I've designed, operated, and rescued mission-critical platforms across cloud and hybrid environments.
My work sits at the intersection of three disciplines: reliability engineering (how systems should behave when nothing is on fire), incident command (how humans should behave when everything is), and platform automation (so neither happens by accident).
The arc
I started in the US Army as a Communications Center Operator working with cryptographic equipment and classified satellite systems. That foundation — operate the equipment, trust the procedure, communicate under pressure — shaped everything that came after.
From there I built and ran high-traffic Linux server farms, PCI-compliant hosting platforms, and streaming infrastructure. I spent 11 years at eBay as a Production Unix Systems Engineer and Incident Responder — leading triage on global e-commerce outages, holding a 99.997% uptime SLA, and automating away 90% of a team's operational toil. I was recognized with a Critical Talent Bonus for that work.
At Upstart I served as Incident Commander for enterprise production incidents, owned Rootly configuration and escalation workflows, and reported reliability metrics directly to executive leadership. Today at Dynascale I architect HA cloud platforms across AWS, Azure, and GCP, lead disaster recovery strategy, and build agentic AI automation pipelines for self-healing remediation.
Why this site exists
Incident command is a learnable skill, and most engineering organizations are still making it up the first time the pager goes off. I built Incident Commander HQ — a training platform and live simulator — to raise that bar. I speak and consult on the same topics. If your team needs to get good at handling incidents, I can help.