I'm Sara Elzayat, a Site Reliability Engineer with 7+ years of experience building and operating distributed systems, observability platforms, and Kubernetes-based infrastructures across cloud-native environments. I’ve designed and managed scalable pipelines, implemented multi-cloud architectures, and led reliability initiatives across globally distributed teams. I thrive on automation, monitoring, and performance tuning, collaborating with protocol engineers to adapt systems to forks and security changes. I’m passionate about making complex systems reliable, observable, and easy to operate, and I enjoy turning intricate tech challenges into robust, scalable solutions.

Sara Elzayat

I'm Sara Elzayat, a Site Reliability Engineer with 7+ years of experience building and operating distributed systems, observability platforms, and Kubernetes-based infrastructures across cloud-native environments. I’ve designed and managed scalable pipelines, implemented multi-cloud architectures, and led reliability initiatives across globally distributed teams. I thrive on automation, monitoring, and performance tuning, collaborating with protocol engineers to adapt systems to forks and security changes. I’m passionate about making complex systems reliable, observable, and easy to operate, and I enjoy turning intricate tech challenges into robust, scalable solutions.

Available to hire

I’m Sara Elzayat, a Site Reliability Engineer with 7+ years of experience building and operating distributed systems, observability platforms, and Kubernetes-based infrastructures across cloud-native environments. I’ve designed and managed scalable pipelines, implemented multi-cloud architectures, and led reliability initiatives across globally distributed teams.

I thrive on automation, monitoring, and performance tuning, collaborating with protocol engineers to adapt systems to forks and security changes. I’m passionate about making complex systems reliable, observable, and easy to operate, and I enjoy turning intricate tech challenges into robust, scalable solutions.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent
Arabic
Advanced

Work Experience

Site Reliability Engineer (Remote) at Zondax
February 1, 2024 - November 26, 2025
Build and manage infrastructure spanning Kubernetes, CI/CD, observability, and distributed databases. Provision and maintain filecoin nodes and services across environments, ensuring uptime and resilience. Design and implement CI/CD pipelines and IaC workflows with GitHub Actions, Terraform, Helm, and Flux. Develop APIs and blockchain indexers in Go for querying, parsing, and processing large-scale distributed data. Build high-throughput gRPC microservices and streaming APIs for real-time blockchain data processing. Operate large-scale pipelines with PostgreSQL and ClickHouse, processing 5TB+ of EVM data with real-time replication. Design and scale OTEL-based telemetry pipelines for metrics, logs, and traces across Kubernetes environments. Deploy and automate Prometheus, Grafana, and Loki stacks, standardizing dashboards and alerting. Define observability governance and best practices, enabling faster incident response and reduced cognitive load for engineers. Collaborate with protocol
Software Engineer (Remote, Kubernetes/GitOps & SRE work) at Weaveworks
December 1, 2023 - December 1, 2023
Built and operated Weave GitOps Enterprise, an enterprise-grade Kubernetes platform for continuous delivery, governance and cluster lifecycle management. Designed and maintained fault-tolerant distributed systems with automated reconciliation and conflict resolution. Built Kubernetes controllers and operators in Go to simplify GitOps workflows and enable automated deployments across multiple clusters. Architected and operated multi-cloud work loads across AWS, GCP and Azure, implementing monitoring, logging and alerting pipelines that scale across dozens of clusters. Automated developer workflows to reduce toil and increase reliability, supporting SRE best practices. Implemented integrations with GitHub, GitLab and Bitbucket for multi-provider authentication and API access.
Cloud Engineer (Hybrid) at Vodafone
April 1, 2020 - April 1, 2020
Re-platformed legacy Vodafone software to Kubernetes for production readiness, enabling container adoption across local markets. Designed, deployed and supported infrastructure on hybrid public/private cloud environments. Worked on troubleshooting and resolving issues across staging and production to ensure reliable service delivery. Assessed cloud providers and applications for migration suitability, guiding cloud strategy decisions. Developed and maintained a Terraform provider for SkyTap and private cloud integrations.
Teaching Assistant (Onsite) at October 6 University
October 1, 2016 - October 1, 2016
Delivered weekly sessions in Algorithms and Data Structures, Computer Graphics, Information Systems and Management Information Systems.

Education

Postgraduate Diploma in Cloud Platform Development at Information Technology Institute (ITI)
January 1, 2016 - January 1, 2017
Bachelor's degree in Computer Science at October 6 University
January 1, 2011 - January 1, 2015

Qualifications

Postgraduate Diploma in Cloud Platform Development
January 1, 2016 - January 1, 2017
Bachelor's degree in Computer Science
January 1, 2011 - January 1, 2015

Industry Experience

Software & Internet, Telecommunications, Professional Services