Hi, I’m Nikhil Kannoju, a Senior Site Reliability Engineer (SRE) with 7+ years of experience in DevOps, automation, and production support. I specialize in cloud infrastructure, container orchestration, and system monitoring across AWS, Azure, and GCP, with a strong focus on building scalable, reliable platforms and efficient incident response. I thrive in cross-functional teams, designing IaC, CI/CD pipelines, and observability solutions. I enjoy automating cloud operations using Terraform, Ansible, and scripting, and I’m comfortable working across Linux/Windows environments, Kubernetes, and serverless architectures. I’m always eager to learn, document, and mentor others to improve reliability and efficiency.

Nikhil Kannoju

Hi, I’m Nikhil Kannoju, a Senior Site Reliability Engineer (SRE) with 7+ years of experience in DevOps, automation, and production support. I specialize in cloud infrastructure, container orchestration, and system monitoring across AWS, Azure, and GCP, with a strong focus on building scalable, reliable platforms and efficient incident response. I thrive in cross-functional teams, designing IaC, CI/CD pipelines, and observability solutions. I enjoy automating cloud operations using Terraform, Ansible, and scripting, and I’m comfortable working across Linux/Windows environments, Kubernetes, and serverless architectures. I’m always eager to learn, document, and mentor others to improve reliability and efficiency.

Available to hire

Hi, I’m Nikhil Kannoju, a Senior Site Reliability Engineer (SRE) with 7+ years of experience in DevOps, automation, and production support. I specialize in cloud infrastructure, container orchestration, and system monitoring across AWS, Azure, and GCP, with a strong focus on building scalable, reliable platforms and efficient incident response.

I thrive in cross-functional teams, designing IaC, CI/CD pipelines, and observability solutions. I enjoy automating cloud operations using Terraform, Ansible, and scripting, and I’m comfortable working across Linux/Windows environments, Kubernetes, and serverless architectures. I’m always eager to learn, document, and mentor others to improve reliability and efficiency.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert

Work Experience

Sr. DevOps Engineer at 2bCloud
March 1, 2025 - Present
Designed and maintained CI/CD pipelines in Azure DevOps and GitHub for customer applications. Implemented high-availability architectures in Kubernetes (AKS/EKS) with auto-scaling and self-healing. Managed AWS/Azure environments including EC2, VMs, load balancers, and storage. Implemented container orchestration with Kubernetes and Helm. Used Terraform and ARM templates for IaC; observability with Prometheus, Grafana, Azure Monitor, CloudWatch, and Elastic Stack. Automated environment provisioning with Ansible and Terraform; built Terraform and Bicep modules for automation and compliance. Provided L2/L3 support for cloud infrastructure, VPN, NSG/SG troubleshooting, and backup restores. Centralized logging and alerting to reduce MTTR; managed multi-customer cloud environments with cost governance and security. Led incident response, SLO/SLI monitoring, and RCA discussions.
Site Reliability Engineer / Production Support at Morgan Stanley
March 1, 2021 - March 1, 2025
Worked on a financial data warehousing project (FDW/ISGF), handling production order processing across multiple systems and resolving incidents. Led critical batch processing and fixed ETL/Autosys failures. Implemented ITIL processes using ServiceNow and Jira; served as Incident Commander for production incidents and vendor coordination. Managed IAM provisioning, AD integrations, and SSO across AWS/Azure/GCP; enforced security controls (SOC 2, ISO 27001). Configured Prod/QA/UAT environments; performed security audits and MFA enforcement. Wrote SQL queries for data extraction and analysis; used Dynatrace, Splunk, and other monitoring tools. Participated in disaster recovery exercises and application releases; contributed to RCA and post-incident reviews.
DevOps / Application Support at Equifax
June 1, 2019 - February 1, 2021
Supported pre-production and production environments; migrated legacy platforms to AWS/Azure/GCP. Designed and implemented API integrations for cloud services using AWS Lambda, API Gateway, and Azure API Management. Investigated authentication failures, database performance issues, and API outages. Built and maintained CI/CD pipelines with Jenkins, GitHub, and Maven; containerized applications with Kubernetes. Managed Docker/Kubernetes deployments; performed incident, problem, change, and release management in ITIL-aligned workflows (ServiceNow/Jira). Implemented monitoring with Datadog, Dynatrace, Splunk; automated operational tasks with Python.
DevOps / Application Support at Wedge Networks
January 1, 2019 - June 1, 2019
Configured IAM roles, VPN access, and security groups; automated tasks with Terraform/Ansible. Built CI/CD pipelines using Jenkins and containerized workloads with Kubernetes/Docker. Monitored systems with Nagios; developed automation scripts for provisioning and operations. Worked on migration and deployment across platforms, ensuring reliability and secure access.
DevOps Engineer / Application Support at Road trek Motor Homes
January 1, 2018 - December 1, 2018
Implemented CI/CD with Jenkins and automation scripts; migrated and deployed on AWS EC2. Managed private cloud environments using Chef/ Puppet for configuration management. Built and tested API integrations, Dockerized workloads, and Kubernetes-based deployments. Implemented monitoring with Nagios and logging/alerting, enabling scalable production support.
Systems Administrator at Value Labs
December 1, 2012 - September 1, 2016
Managed a large fleet of Linux/Unix servers; implemented automation with Puppet/Ansible. Supported VMware/VSphere environments; performed patching, monitoring, and ITIL-aligned incident response with ServiceNow/JIRA. Built and maintained CI/CD pipelines, containerized workloads, and cloud migrations (AWS/Azure/GCP).

Education

Bachelor of Technology (E.C.E) at Vivekananda Institute of Technology, Hyderabad, India
August 1, 2012 - February 17, 2026
Post-secondary Diploma in Mobile Applications Design and Development at Lambton College, Toronto, ON, Canada
November 1, 2017 - February 17, 2026

Qualifications

Azure Solutions Architect Expert (Az-305)
January 11, 2030 - February 17, 2026
Azure Administrator Associate (Az-104)
January 11, 2030 - February 17, 2026
AWS Certified Solutions Architect
January 11, 2030 - February 17, 2026

Industry Experience

Software & Internet, Financial Services, Professional Services, Computers & Electronics