Senior AWS-Certified Cloud Technical Support & Site Reliability Engineer with 9+ years building and defending mission-critical airline platforms. I make systems reliable, incidents shorter, and operations smarter with AI.
I've spent 9+ years in airline IT and cloud operations, working at the intersection of reliability, automation, and AI. From leading 24/7 global incident management for Singapore Airlines to building LLM-powered intelligent operations, I focus on one thing: making sure the systems people rely on actually work.
What drives me is solving high-impact problems under pressure. I'm now looking to bring that mindset to a team that values problem-solving, reliability, and operational ownership on a global scale. Currently pursuing my Generative AI Developer Professional certification.
Enterprise monitoring across 100+ apps with predictive alerting using Splunk, CloudWatch, Datadog, Dynatrace.
Serverless retry system with exponential backoff ensuring booking confirmations reach customers reliably.
Pattern recognition engine routing unbilled bookings to payment gateways. Recovers $1M+ annually.
AES-256-CBC encrypted secrets manager with RBAC and audit logging. Containerized on EKS via ECR pipeline.
Integrated GPT/Claude with enterprise operations for incident analysis, auto-documentation, and predictive resolution.
This interactive site with morphing blobs, command palette, and boot terminal. Hosted on AWS S3 + CloudFront.
Amazon Web Services
Valid till 2027
Amazon Web Services
Valid till 2027
The Linux Foundation
Associate Level
HashiCorp
Associate Level
Scrum.org
Agile Leadership
Splunk Inc.
Data Analytics
Currently Studying
In ProgressRecognized for outstanding leadership in managing global teams and delivering exceptional system reliability across time zones.
Pioneered LLM integration with enterprise operations, setting new standards for intelligent incident management and predictive analytics.
Achieved $1M annual revenue recovery through missed transaction capture and automated payment processing optimizations.
Maintained 99.99% SLO while reducing P1 incidents by 83%, demonstrating exceptional operational commitment.
Consistently closed 500+ tickets monthly within SLA, reducing MTTR by 65% through process optimization.
Reduced complaints by 30% through email automation and improved payment processing reliability.