I keep critical financial systems running at 99.9%+ uptime — on-call rotations, incident response, and CI/CD automation for Java and legacy Oracle/WebLogic stacks. Background in distributed systems and competitive programming.
PostgreSQL + Node.js/Express + React/nginx on Minikube with reverse proxy and persistent volume claims. Debugged layered routing and container networking.
EC2 + CloudWatch + SNS observability stack — sub-2-minute failure detection, automated alerts, recovery validated via simulated crashes.
End-to-end monitoring for HTTP endpoints and MySQL with custom dashboards for latency, throughput, error rate, and threshold alerting.
Full-stack Java app (Tomcat, MySQL, Nginx) on a self-managed 4-node cluster. Deployments, Services, ConfigMaps, rolling updates, and horizontal scaling.
End-to-end pipeline for a Java app — Git, Maven, and Docker integrated into Jenkins with staged promotion across environments.
Playbooks provisioning multi-tier environments (Tomcat, MySQL, Nginx, RabbitMQ, Memcached) with idempotent, repeatable config across nodes.
RESTful API on EC2 with RDS/PostgreSQL backend. CloudWatch metrics, health checks, and automated restart scripts for resilience under load.
Skyscanner-style REST API in Spring Boot with layered architecture and caching to cut repeated query load and improve response time.