Senior SRE & Technical Director
Principal Site Reliability Engineer (SRE) & Engineering Manager with 25+ years of hands-on experience designing and operating mission-critical data infrastructure. Deeply specialized in Cloud Architecture (Oracle Cloud/AWS), Kubernetes & Docker, Database Reliability Engineering (DBRE), and architecting stateful systems using Oracle, PostgreSQL, MongoDB, and ClickHouse. I build Observability-Driven cultures utilizing Zabbix, Prometheus, and Grafana, drive CI/CD excellence using Jenkins, DroneCI, and ArgoCD, and automate remediation with Ansible and Scripting. I lead and mentor high-performance engineering teams to solve complex scalability challenges, driving FinOps efficiency, robust security (WAF/IAM), and operational excellence through automation. Proven track record of maintaining 99.95% availability, achieving 30% TCO reduction through FinOps practices, and mentoring over 500 professionals in the database ecosystem.
Technical lead for cloud operations serving 100+ business clients with approximately 40,000 vehicles monitored 24x7. Engineered high-availability architectures and automated remediation workflows.
Technical leadership of a team serving 300+ clients across various sectors such as retail, healthcare, and industry.
Acted as Technical Lead for cloud infrastructure administration for companies like SevenBoys / Wickbold (one of the country's largest bread manufacturers) and Unimed (one of the largest health insurance networks in Brazil).