I am a Senior Principal DevOps Engineer / SRE with over 14 years of experience managing large-scale, mission-critical production environments. I specialize in Oracle Cloud Infrastructure (OCI), Observability (Prometheus/Grafana stack), and Infrastructure as Code, and I enjoy building automation that reduces toil and accelerates delivery. I thrive at bridging legacy infrastructure with modern containerized cloud solutions and leading cross-functional teams to reliable, scalable outcomes. I currently lead Cloud Operations efforts at Oracle, mentor engineers, and architect enterprise-grade automation suites. I am passionate about reliability, security, and performance, and I love turning complex challenges into repeatable, auditable workflows that empower teams to move faster with confidence.

Vlad Gusyev

I am a Senior Principal DevOps Engineer / SRE with over 14 years of experience managing large-scale, mission-critical production environments. I specialize in Oracle Cloud Infrastructure (OCI), Observability (Prometheus/Grafana stack), and Infrastructure as Code, and I enjoy building automation that reduces toil and accelerates delivery. I thrive at bridging legacy infrastructure with modern containerized cloud solutions and leading cross-functional teams to reliable, scalable outcomes. I currently lead Cloud Operations efforts at Oracle, mentor engineers, and architect enterprise-grade automation suites. I am passionate about reliability, security, and performance, and I love turning complex challenges into repeatable, auditable workflows that empower teams to move faster with confidence.

Available to hire

I am a Senior Principal DevOps Engineer / SRE with over 14 years of experience managing large-scale, mission-critical production environments. I specialize in Oracle Cloud Infrastructure (OCI), Observability (Prometheus/Grafana stack), and Infrastructure as Code, and I enjoy building automation that reduces toil and accelerates delivery. I thrive at bridging legacy infrastructure with modern containerized cloud solutions and leading cross-functional teams to reliable, scalable outcomes.

I currently lead Cloud Operations efforts at Oracle, mentor engineers, and architect enterprise-grade automation suites. I am passionate about reliability, security, and performance, and I love turning complex challenges into repeatable, auditable workflows that empower teams to move faster with confidence.

See more

Experience Level

Expert
Expert
Expert
Expert

Language

English
Fluent

Work Experience

Senior Principal Member of Technical Staff, Cloud Operations at Oracle Corporation
September 1, 2024 - Present
Lead automation architecture and telemetry modernization for OCI production environments. Developed and maintained automation pipelines using TeamCity to manage production tooling updates, reducing manual maintenance overhead. Implemented an Alert Remediator pipeline using Chef to automatically resolve specific production alerts, significantly lowering MTTR. Engineered real-time inventory extraction from OCI and ingestion into Grafana for inventory management and reporting. Investigated replacement strategies for HashiCorp Consul to modernize node and service inventory management. Authored comprehensive telemetry tooling documentation and onboarding materials to accelerate new hire productivity.
Principal Member of Technical Staff, Cloud Operations at Oracle Corporation
January 1, 2021 - September 1, 2024
Architected CI/CD pipelines for the entire observability stack (Prometheus, Alertmanager, Grafana) and multiple exporters, enabling centralized deployment across QA, Staging, and Production via a single repository. Built a fully automated telemetry update suite using TeamCity for centralized management of monitoring agents. Managed cloud assets and security configurations with Terraform; developed custom Chef cookbooks for service installation and configuration. Implemented a Prometheus Alertmanager Helm chart to centralize multi-region configurations. Created Python/PowerShell/Bash scripts to bridge legacy applications with modern monitoring tools.
Manager, Cloud Operations at Oracle Corporation
August 1, 2018 - January 1, 2021
Led a global team of 6 senior staff and system administrators, overseeing a hybrid environment of over 3,000 servers. Directed OCI migrations for critical applications and led technical operations during major compliance audits (HIPAA, SOC, FedRAMP). Modernized observability by integrating apps with central logging and metrics (ELK Stack, Prometheus, Grafana). Managed vulnerability management and patching programs.
Principal Systems Administrator (IC-4) at Oracle Corporation
January 1, 2017 - August 1, 2018
Security and optimization through proactive maintenance; managed DNS (SPF, DKIM, DMARC) for application delivery and verification. Served as primary liaison between Operations and Dev/DevOps to finalize designs for production projects. Standardized recurring tasks and trained junior staff; provided critical data for annual security audits.
Senior Systems Administrator (IC-3) at Oracle Corporation
January 1, 2015 - January 1, 2017
Tool development for internal efficiency (WebServer query tool, F5 Load Balancer tool, self-updating CMDB). Installed and maintained OEM 12/13 stack; created custom metrics and automation for target configuration. Managed large-scale Linux and Windows environments; maintained virtualization stacks (Hyper-V, Oracle VM) and storage appliances.
NOC Supervisor at Proofpoint, Inc.
November 1, 2013 - December 1, 2014
Led a global NOC team (4 engineers); reviewed incidents and reported process metrics. Built new-hire training processes to minimize onboarding time; updated technical documentation. Automated data collection for troubleshooting and auditing.
NOC Engineer at Proofpoint, Inc.
September 1, 2012 - November 1, 2013
Deployed and configured customer server clusters; provisioned Dell PowerEdge servers and KVM virtualization. Executed production changes for DNS, DHCP, MySQL, MSSQL, and SVN; performed emergency system migrations and data recovery.
NOC Engineer at Proofpoint, Inc.
November 1, 2011 - September 1, 2012
Managed production changes; supported DNS, DHCP, MySQL, MSSQL, and SVN; performed data recovery and migrations on Linux environments.
NOC Engineer at GANZ STUDIOS
October 1, 2010 - November 1, 2012
Created Bash automation for backups and a custom JavaScript dashboard for real-time production health monitoring. Troubleshot critical production incidents in a mixed Linux/Windows environment using Orion and OpenNMS.

Education

Diploma in Network Administration and Security at RCC Institute of Technology
January 11, 2030 - February 17, 2026
Diploma in Network Administration and Security at RCC Institute of Technology
January 11, 2030 - February 19, 2026

Qualifications

Oracle Cloud Infrastructure (OCI) AI Foundations Associate
January 11, 2030 - February 17, 2026
Oracle Cloud Infrastructure (OCI) Foundations Associate
January 11, 2030 - February 17, 2026
Oracle Data Platform Foundations Associate
January 11, 2030 - February 17, 2026
Oracle Cloud Infrastructure (OCI) AI Foundations Associate
January 11, 2030 - February 19, 2026
Oracle Cloud Infrastructure (OCI) Foundations Associate
January 11, 2030 - February 19, 2026
Oracle Data Platform Foundations Associate
January 11, 2030 - February 19, 2026

Industry Experience

Software & Internet, Computers & Electronics, Professional Services