Hi, I’m Janei Araujo, a Site Reliability Engineer and DevOps specialist focused on automating infrastructure, improving security and reliability across AWS environments. I lead multi-account governance, manage Kubernetes (EKS) clusters with Karpenter, ALB Controller and Helm, and build robust CI/CD pipelines with GitHub Actions. I also design and implement observability using Prometheus, Grafana and Alertmanager, and I excel at translating complex systems into scalable, maintainable solutions while collaborating closely with development teams to deliver reliable software. I thrive in crisis situations, drive performance optimizations and cost-control initiatives, and enjoy continuous improvement of processes and automation. Based in Rio de Janeiro, I value clear communication, teamwork, and delivering value through resilient architectures and automated workflows for globally distributed projects.

Janei Araujo

Hi, I’m Janei Araujo, a Site Reliability Engineer and DevOps specialist focused on automating infrastructure, improving security and reliability across AWS environments. I lead multi-account governance, manage Kubernetes (EKS) clusters with Karpenter, ALB Controller and Helm, and build robust CI/CD pipelines with GitHub Actions. I also design and implement observability using Prometheus, Grafana and Alertmanager, and I excel at translating complex systems into scalable, maintainable solutions while collaborating closely with development teams to deliver reliable software. I thrive in crisis situations, drive performance optimizations and cost-control initiatives, and enjoy continuous improvement of processes and automation. Based in Rio de Janeiro, I value clear communication, teamwork, and delivering value through resilient architectures and automated workflows for globally distributed projects.

Available to hire

Hi, I’m Janei Araujo, a Site Reliability Engineer and DevOps specialist focused on automating infrastructure, improving security and reliability across AWS environments. I lead multi-account governance, manage Kubernetes (EKS) clusters with Karpenter, ALB Controller and Helm, and build robust CI/CD pipelines with GitHub Actions. I also design and implement observability using Prometheus, Grafana and Alertmanager, and I excel at translating complex systems into scalable, maintainable solutions while collaborating closely with development teams to deliver reliable software.

I thrive in crisis situations, drive performance optimizations and cost-control initiatives, and enjoy continuous improvement of processes and automation. Based in Rio de Janeiro, I value clear communication, teamwork, and delivering value through resilient architectures and automated workflows for globally distributed projects.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

Portuguese
Fluent
English
Advanced

Work Experience

Site Reliability Engineer at Equinix
January 1, 2023 - Present
Lead infrastructure automation, security, and reliability across AWS cloud environments, with a focus on scalability and multi-account governance. Responsibilities include Terraform-based multi-account automation, managing EKS clusters with Karpenter, ALB Controller, and Helm; implementing CI/CD pipelines via GitHub Actions; secrets management with AWS Secrets Manager and SSM; environment monitoring with Prometheus, Grafana, Alertmanager; real-time incident response and troubleshooting; creation of custom Docker images for pipelines; governance and cost control per environment/project. Achievements include a 70% reduction in deployment time, increased stability by isolating namespaces and node groups, and zero downtime during planned EKS upgrades.
Senior Information Technology System Analyst at Equinix
January 1, 2023 - October 10, 2025
Led crisis response to quickly restore customers' operations, identified improvements in managed environments, maintained equipment per best practices, and performed capacity analyses to forecast resource upgrades. Responsible for documentation and maintenance of client servers within contracted hosting management offerings.
Information Technology System Analyst at Equinix
January 1, 2022 - October 10, 2025
Acted in crises to rapidly restore operations, identified opportunities for improvements in managed environments, ensured equipment was configured according to industry best practices, and conducted capacity analyses to predict resource upgrades. Performed maintenance and documentation of client servers under hosting management products.
DevOps Engineer at Korporate Solution Factory
August 1, 2025 - October 10, 2025
Contributed to the development and maintenance of mission-critical environments with emphasis on automation, observability, and cloud application scalability. Implemented and optimized CI/CD pipelines, managed Kubernetes (EKS), provisioned IaC with Terraform across multiple AWS accounts, applied cost control and FinOps practices, and worked on monitoring/observability with Zabbix, Prometheus, and Grafana. Managed GitHub Actions pipelines for staging and production deployments; designed resilient architectures with Load Balancers, Karpenter, ASG, and high-availability strategies; focused on security/compliance including secrets management, CloudTrail auditing, WAF, and backup/recovery automation.
IT Consultant at Mec Tecnologia
March 1, 2019 - October 10, 2025
Advised on IT solutions, focusing on infrastructure and operations improvements within Brazil's market; delivered technical consultancy and implementation support.
IT Support at Atento
September 1, 2016 - October 10, 2025
Provided client support services, assisting with technical issues and ensuring smooth operation of IT-related tasks.
Site Reliability Engineer at Equinix
January 1, 2023 - Present
Lead infrastructure automation across multi-account AWS, focusing on scalability, performance, and reliability. Manage EKS clusters with Karpenter, ALB Controller, and Helm; implement CI/CD pipelines with GitHub Actions across multiple environments and clients. Handle secrets and sensitive variables with AWS Secrets Manager and SSM; monitor environments with Prometheus, Grafana, and Alertmanager, with Slack integration. Drive continuous security improvements through dynamic IAM roles using AssumeRole and STS. Provide real-time support to development teams for troubleshooting, log analysis, and incident response. Create and maintain custom Docker images for pipelines and critical environments. Govern environment and project costs across the portfolio; achieved a 70% reduction in deployment time and zero downtime during planned EKS upgrade windows.
Senior Information Technology System Analyst
January 1, 2023 - October 10, 2025
Led crisis response to rapidly restore client operations and corrected vulnerabilities. Identified opportunities for improvements in managed environments, ensured equipment configuration aligned with best market practices, and performed capacity analysis to forecast resource upgrades. Managed maintenance and documentation of client servers under Hosting Management product contracts.
Information Technology System Analyst
January 1, 2022 - October 10, 2025
Delivered crisis response to restore customer operations, corrected vulnerabilities, and implemented improvements in managed environments. Maintained infrastructure in line with industry best practices; performed capacity analyses to forecast upgrades and managed client server documentation under hosting contracts.
DevOps Engineer at Korporate Solution Factory
August 1, 2025 - October 10, 2025
Contributed to development and maintenance of mission-critical environments with emphasis on automation, observability, and cloud application scalability. Implemented and optimized CI/CD pipelines; managed Kubernetes (EKS) clusters; provisioned infrastructure as code with Terraform; integrated multiple AWS accounts with cost control and FinOps practices. Implemented monitoring and observability using Zabbix, Prometheus, and Grafana; managed GitHub Actions pipelines across staging and production. Designed resilient architectures leveraging Load Balancers, Karpenter, Auto Scaling Groups, and high availability. Focused on security and compliance including secrets management with AWS Secrets Manager, CloudTrail audit monitoring, WAF configuration, and automation of backup/recovery.
IT Consultant at Mec Tecnologia
March 1, 2019 - October 10, 2025
Provided IT consulting and infrastructure support across client environments, identifying opportunities for improvement and ensuring compliance with industry best practices. Performed capacity analysis, managed hosting services, and maintained servers under contracted solutions; contributed to crisis response and customer satisfaction.
Customer Support at Atento
September 1, 2016 - October 10, 2025
Delivered front-line customer support, troubleshooting issues, and resolving incidents, contributing to service quality and customer satisfaction.
Site Reliability Engineer at Equinix
January 1, 2023 - Present
Lead infrastructure automation across multi-account AWS environments, focusing on scalability, performance and reliability. Implemented Terraform-driven infrastructure, managed EKS clusters with Karpenter, ALB Controller and Helm, and built CI/CD pipelines with GitHub Actions across multiple environments. Implemented secret management with AWS Secrets Manager and SSM, and established environment monitoring with Prometheus, Grafana, and Alertmanager (with Slack integration). Drove continuous security improvements via dynamic IAM roles (AssumeRole/STS), provided real-time production support, and created custom Docker images for pipelines and critical environments. Oversaw governance and cost control per environment/project; achieved a 70% reduction in deployment time and zero downtime during EKS upgrades.
Senior Information Technology System Analyst at Equinix
January 31, 2023 - October 10, 2025
Crisis response and remediation of vulnerabilities to quickly restore customer operations. Identified opportunities for improvements in managed environments, maintained equipment aligned with best practices, and conducted capacity analysis predicting resource upgrades. Managed documentation and maintenance of client servers within Hosting Management products and ensured compliance with product descriptions.
Information Technology System Analyst at Equinix
January 31, 2022 - October 10, 2025
Identified opportunities for improvements in managed environments, corrected vulnerabilities, and performed capacity analysis to predict resource upgrade needs. Maintained client servers and hosting environments in accordance with product descriptions and market practices; produced documentation and ensured operational readiness.
DevOps Engineer at Korporate Solution Factory
August 31, 2025 - October 10, 2025
Contributed to development and maintenance of mission-critical environments with a strong focus on automation, observability and cloud application scalability. Implemented and optimized CI/CD pipelines, managed Kubernetes (EKS), provisioned infrastructure across multiple AWS accounts, and applied cost control and FinOps practices. Worked on monitoring and observability with tools such as Zabbix, Prometheus and Grafana, while managing GitHub Actions pipelines to orchestrate deployments across staging and production. Designed resilient architectures with Load Balancers, Karpenter, Auto Scaling Groups and namespace isolation. Emphasized security/compliance with IAM, WAF, CloudTrail, and automated backup/recovery processes.
IT Consultant at Mec Tecnologia
March 31, 2019 - October 10, 2025
Provided IT consulting services focused on system optimization, vulnerability remediation, and best-practice configuration for managed environments. Performed capacity analysis, documentation, and ongoing maintenance for client servers within Hosting Management products.
Customer Support at Atento
September 30, 2016 - October 10, 2025
Supported customers with timely issue resolution and service guidance, contributing to service quality and customer satisfaction during a rapid growth period.

Education

Bachelor's at Universidade Estácio de Sá
January 1, 2018 - January 1, 2020
Graduação em Defesa Cibernética at Universidade Estácio de Sá
January 1, 2018 - January 1, 2020
Graduação at Universidade Estácio de Sá
January 1, 2018 - December 31, 2020

Qualifications

Oracle Cloud Infrastructure Foundations 2021 Associate
January 1, 2021 - October 10, 2025
Oracle Cloud Infrastructure Foundations 2022 Associate
January 1, 2022 - October 10, 2025
GitHub Copilot: Formação Básica
January 11, 2030 - October 10, 2025
GitHub Actions: Formação Básica
January 11, 2030 - October 10, 2025
IBM Cloud Essentials V3
January 11, 2030 - October 10, 2025
Oracle Cloud Infrastructure Foundations 2021 Associate
January 1, 2021 - October 10, 2025
Oracle Cloud Infrastructure Foundations 2022 Associate
January 1, 2022 - October 10, 2025
GitHub Copilot: Formação Básica
January 11, 2030 - October 10, 2025
GitHub Actions: Formação Básica
January 11, 2030 - October 10, 2025
IBM Cloud Essentials V3
January 11, 2030 - October 10, 2025
Oracle Cloud Infrastructure Foundations 2021 Associate
January 1, 2021 - October 10, 2025
Oracle Cloud Infrastructure Foundations 2022 Associate
January 1, 2022 - October 10, 2025
IBM Cloud Essentials V3
January 11, 2030 - October 10, 2025
GitHub Copilot: Formação Básica
January 11, 2030 - October 10, 2025
GitHub Actions: Formação Básica
January 11, 2030 - October 10, 2025

Industry Experience

Software & Internet, Professional Services, Telecommunications, Computers & Electronics