Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

Site Reliability Engineer focused on automation, availability, performance, and security.‎ Skilled in: • Infrastructure & Cloud: AWS, GCP, Kubernetes, Terraform, Helm, Docker • Automation & IaC: Python, Go, GitLab CI/CD, Argo CD, Jenkins, CircleCI • Observability: Prometheus, Grafana, Mimir, OpenTelemetry, Pagerduty, ELK I operate with Agile teams. I focus on reducing manual work, improving uptime, and supporting reliable delivery. Outside work I compose music, climb, and travel.…Site Reliability Engineer focused on automation, availability, performance, and security.‎ Skilled in: • Infrastructure & Cloud: AWS, GCP, Kubernetes, Terraform, Helm, Docker • Automation & IaC: Python, Go, GitLab CI/CD, Argo CD, Jenkins, CircleCI • Observability: Prometheus, Grafana, Mimir, OpenTelemetry, Pagerduty, ELK I operate with Agile teams. I focus on reducing manual work, improving uptime, and supporting reliable delivery. Outside work I compose music, climb, and travel.

George D.





Site Reliability Engineer focused on automation, availability, performance, and security.‎ Skilled in: • Infrastructure & Cloud: AWS, GCP, Kubernetes, Terraform, Helm, Docker • Automation & IaC: Python, Go, GitLab CI/CD, Argo CD, Jenkins, CircleCI • Observability: Prometheus, Grafana, Mimir, OpenTelemetry, Pagerduty, ELK I operate with Agile teams. I focus on reducing manual work, improving uptime, and supporting reliable delivery. Outside work I compose music, climb, and travel.…Site Reliability Engineer focused on automation, availability, performance, and security.‎ Skilled in: • Infrastructure & Cloud: AWS, GCP, Kubernetes, Terraform, Helm, Docker • Automation & IaC: Python, Go, GitLab CI/CD, Argo CD, Jenkins, CircleCI • Observability: Prometheus, Grafana, Mimir, OpenTelemetry, Pagerduty, ELK I operate with Agile teams. I focus on reducing manual work, improving uptime, and supporting reliable delivery. Outside work I compose music, climb, and travel.

Available to hire

Site Reliability Engineer focused on automation, availability, performance, and security.‎
Skilled in:
• Infrastructure & Cloud: AWS, GCP, Kubernetes, Terraform, Helm, Docker
• Automation & IaC: Python, Go, GitLab CI/CD, Argo CD, Jenkins, CircleCI
• Observability: Prometheus, Grafana, Mimir, OpenTelemetry, Pagerduty, ELK

I operate with Agile teams. I focus on reducing manual work, improving uptime, and supporting reliable delivery. Outside work I compose music, climb, and travel.

Skills

Experience Level

Expert

Expert

Expert

Expert

Language

English

Fluent

Work Experience

Site Reliability Engineer at Veeva Systems

April 3, 2022 - Present

• Designed and maintained a centralized Infrastructure-as-Code repository with versioned Terraform modules, enforcing modular standards and DRY practices across teams. • Built GitOps-driven pipelines in GitLab and ArgoCD with dynamic builds, parallelized tests, caching, and automated E2E/load testing, improving deployment speed and reliability. • Implemented ephemeral, feature-specific environments with GitLab, ArgoCD, and Helm, delegating non-critical stack configuration to development teams for faster iteration. • Architected centralized-distributed observability using Grafana Alloy, aggregating logs, metrics, and traces into Loki, Mimir, and Tempo for full-stack visibility. • Defined and enforced SLAs, SLIs, and SLOs with stakeholders; implemented Prometheus recording rules and dashboards to monitor 1/7/30-day windows and ensure compliance. • Designed and rolled out Backstage as the Internal Developer Portal (IDP), now adopted product-wide for onboarding, documentation, and service catalogs. • Standardized monitoring, metrics, and logging across services by developing custom Grafana dashboards and TypeScript modules for health checks, GraphQL metrics, and trace-span injection.

Site Reliability Engineer at Oanda

July 13, 2020 - April 1, 2022

• Architected and deployed an enterprise observability platform (Prometheus, Grafana, Alertmanager, Pushgateway) delivering real-time metrics and resilient alerting across on-prem systems. • Automated PagerDuty on-call configuration with Terraform IaC, enabling engineers to self-serve incident workflows and reduce operational toil. • Designed and launched a scalable cloud-native monitoring solution on GCP (Cortex, GKE, Helm, Vault), enhancing security and reliability for production workloads. • Eliminated toil by automating on-call compensation calculations in Go, integrating PagerDuty + Google Sheets APIs with Cloud Functions and CircleCI. • Defined and enforced SLOs/SLIs for critical funding services, improving reliability and aligning performance with customer expectations. • Standardized monitoring deployments across environments with a configuration management toolkit (Ansible, Makefiles, Node.js, GoCD, Jinja), accelerating rollout speed.

Software Engineer at Leonardo

March 25, 2018 - July 9, 2020

• Automated infrastructure provisioning with Python-based IaC (CloudFormation, Kubernetes, Ansible), laying the foundation for scalable, reliable environments. • Built and maintained GitLab CI/CD pipelines and Kubernetes manifests to ensure consistent, automated deployments across services. • Implemented observability for Java and Node.js applications with Prometheus, Grafana, and Kibana, improving system reliability and troubleshooting speed. • Modernized backend systems by contributing to a Spring Boot microservices architecture (Elasticsearch, Kafka, AWS), boosting scalability and maintainability. • Designed and launched an API Gateway with GraphQL, gRPC, and Docker to unify and secure service access, strengthening platform reliability. • Led a sprint efficiency initiative across departments, fostering self-organized teams and reducing operational blockers.