Dynascale Inc.
03/2024 – PresentSenior Site Reliability Engineer · Incident Commander & Responder
- ▹Architect and operate highly available cloud platforms across AWS, Azure, and GCP supporting multiple client production environments.
- ▹Serve as senior Incident Commander for customer and platform incidents — triage, mitigation, escalation, and post-incident remediation.
- ▹Improved observability through custom alerting pipelines and real-time telemetry integration.
- ▹Reduced cloud spend through reserved instances, autoscaling optimization, and rightsizing.
- ▹Automated infrastructure lifecycle with Terraform, CloudFormation, and Ansible — cutting deployment lead time and operational risk.
- ▹Reduced manual operational intervention by 30% through automation, self-healing workflows, and standardization.
- ▹Lead disaster recovery strategy: backup validation, failover testing, and incident response playbooks.
- ▹Developing agentic AI automation pipelines for system administration and self-healing remediation across Hyper-V, AWS, and Azure.
- ▹Mentor engineers on reliability engineering, automation practices, and production ownership.