Hi, I’m Marcos Orsi Ragazzo do Prado. I’m a seasoned SRE/DevOps and Kafka specialist with 16+ years in IT. I’m passionate about deploying, scaling, and automating Apache Kafka and Confluent clusters to support high-throughput, multi-tenant environments. I’ve led multi-cloud migrations (AWS, Azure, OCI), built robust CI/CD pipelines, and designed reliable, scalable architectures. I enjoy collaborating with development and security teams to deliver resilient, production-grade systems.

Marcos Orsi Ragazzo do Prado

Hi, I’m Marcos Orsi Ragazzo do Prado. I’m a seasoned SRE/DevOps and Kafka specialist with 16+ years in IT. I’m passionate about deploying, scaling, and automating Apache Kafka and Confluent clusters to support high-throughput, multi-tenant environments. I’ve led multi-cloud migrations (AWS, Azure, OCI), built robust CI/CD pipelines, and designed reliable, scalable architectures. I enjoy collaborating with development and security teams to deliver resilient, production-grade systems.

Available to hire

Hi, I’m Marcos Orsi Ragazzo do Prado. I’m a seasoned SRE/DevOps and Kafka specialist with 16+ years in IT. I’m passionate about deploying, scaling, and automating Apache Kafka and Confluent clusters to support high-throughput, multi-tenant environments.

I’ve led multi-cloud migrations (AWS, Azure, OCI), built robust CI/CD pipelines, and designed reliable, scalable architectures. I enjoy collaborating with development and security teams to deliver resilient, production-grade systems.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert

Language

Portuguese
Fluent
English
Advanced

Work Experience

External Consultant DevOps / Kafka Engineer at Self-Employed / Freelance
November 1, 2023 - November 13, 2025
Delivered advanced Kubernetes and Apache Kafka solutions as an external consultant for multi-profile clients, including Volvo Cars (current), Siigo, Alternative Payments, and Michelin Connected Fleet. Supported and optimized large-scale Kafka streaming systems with broker management, connectors, transformers, and Kafka Connect deployment. Led migration of production microservices (throughput >80 MB/s IN, >250 MB/s OUT) with zero data loss, and designed high-throughput pipelines. Implemented automation for Kafka and Linux infrastructure on AWS, Azure, and OCI using custom Ansible playbooks, reducing operational work and accelerating delivery. Authored comprehensive Kafka documentation and improved observability and reliability across Kubernetes platforms.
Data Reliability Engineer at Hotelbeds
November 1, 2023 - November 1, 2023
Managed geo-distributed Apache Kafka clusters across Madrid on-prem, AWS Ireland, and GCP Asia. Designed Terraform modules for MSK, RDS, and ElastiCache to accelerate provisioning. Planned and executed Kafka migrations to Amazon MSK/EC2 with zero downtime; integrated Metabase for analytics; enhanced security with IAM authentication. Led updates and migrations of Strimzi Operator, Apache Kafka, Kafka Connect, and Zookeeper to minimize disruption; improved team onboarding and knowledge transfer.
Site Reliability Engineer at Evolution
November 1, 2022 - November 1, 2022
Supported a large geo-distributed production environment for a global leader in online gaming solutions. Automated patching and upgrades, ensured high availability, scaling, and disaster recovery readiness. Implemented Terraform-based automation to provision AWS/Azure infrastructure; improved monitoring with Grafana/Prometheus; enhanced reliability and operational efficiency across teams.
Site Reliability Engineer at Tivit
January 1, 2021 - January 1, 2021
Dedicated to mission-critical operations for one of Brazil's largest financial companies (Celio S.A.). Designed and implemented HA/DR strategies, patching cadence, and cross-team collaboration across CI/CD, Development, and Security. Built and maintained Terraform modules for cloud infrastructure to improve reliability and scalability.
IT Support Specialist at Tecnomaq Tecnologia
December 1, 2016 - December 1, 2016
Help Desk (2009-2010); IT Support Analyst (2010-2012); IT Support Specialist (2012-2016).
Infrastructure Analyst at Aegea / GSS
May 1, 2018 - May 1, 2018
Maintained mission-critical billing servers across production, QA, and development environments; supported infrastructure modernization and performance improvements.
SRE/DevOps - External Consultant (Kafka Engineer) at Various Clients (Volvo Cars, Siggo, Alternative Payments, Michelin Connected Fleet)
November 1, 2023 - Present
Delivered advanced Kubernetes and Kafka solutions for high-throughput, multi-tenant environments; supported broker management, connectors, transformers, and Kafka Connect deployment.
Data Reliability Engineer at Hotelbeds
November 1, 2022 - November 1, 2023
Managed geo-distributed Apache Kafka clusters across Madrid (on-prem), AWS Ireland, and GCP Asia. Designed and created Terraform modules for Amazon MSK, RDS, and Elasticache to accelerate infrastructure provisioning. Migrated production pipelines to Kafka Connect and ZooKeeper, improving security via IAM authentication and zero-downtime transitions.
Kafka Engineer (Freelancer) - DevOps Team at DevOps Team (Freelancer)
January 1, 2023 - May 31, 2023
Supported strategic decisions to optimize production MSK deployment; planned tiered storage (up to 12 TB per node) and performed PoCs; authored documentation and onboarding materials; led Kafka Connect and Kafka topic provisioning via Terraform.
Site Reliability Engineer at Evolution
November 1, 2021 - November 1, 2022
Supported a large geo-distributed online gaming production environment; automated CI/CD pipelines; ran Kubernetes clusters on-prem and in public cloud; implemented best practices for reliability and scalability; led Terraform adoption across AWS/Azure/OpenShift.
Site Reliability Engineer at IV IT
July 1, 2020 - January 31, 2021
Dedicated to a mission-critical operation for one of Brazil's largest financial companies; responsible for reliability and performance in a high-availability environment.
IT Support Specialist at Tecnomaq Technology
November 1, 2008 - December 31, 2016
Help desk and IT support spanning hardware, software, and network issues; progressed to a specialized IT support role.
Infrastructure Analyst at AEGEA / GSS
December 1, 2016 - May 1, 2018
Infrastructure analyst focusing on billing servers and related systems; supported operations, reliability and performance improvements.

Education

Postgraduate in Business Administration at IBEFGV
January 11, 2030 - January 1, 2016
Graduate in Computer Networking at UNIP
January 11, 2030 - January 1, 2011
Postgraduate in Business Administration at IBEFGV
January 1, 2016 - January 12, 2026
Graduate in Computer Networking at UNIP
January 1, 2011 - January 12, 2026

Qualifications

CKA (Certified Kubernetes Administrator)
January 11, 2030 - November 13, 2025
CKAD (Certified Kubernetes Application Developer)
January 11, 2030 - November 13, 2025
Confluent Certified Administrator for Apache Kafka
January 11, 2030 - November 13, 2025
Confluent Certified Developer for Apache Kafka
January 11, 2030 - November 13, 2025
Oracle Cloud Certification: Architect Associate / Professional
January 11, 2030 - November 13, 2025
Red Hat OpenShift Delivery Specialist / Platform OpenShift
January 11, 2030 - November 13, 2025
CKA/CKAD - Certified Kubernetes Administrator/Developer
January 11, 2030 - January 12, 2026
Confluent Certified Administrator for Apache Kafka
January 11, 2030 - January 12, 2026
Oracle Cloud Certification: Architect Associate
January 11, 2030 - January 12, 2026
Oracle Cloud Certification: Architect Professional
January 11, 2030 - January 12, 2026
Red Hat Delivery Specialist Platform / OpenShift Container Platform Support
January 11, 2030 - January 12, 2026
Red Hat Sales Engineer Specialist
January 11, 2030 - January 12, 2026

Industry Experience

Software & Internet, Professional Services, Financial Services, Gaming, Media & Entertainment