Available to hire
I’m Alex, a London-based Cloud/Platform/DevOps and Site Reliability Engineer with a track record of delivering scalable, compliant infrastructure across finance, media, and tech customers. I enjoy turning complex requirements into reliable systems—covering SOC 2 Type 2 compliance, disaster recovery, observability, and automation.
Currently founding Platform Engineer at Magentic Labs, I lead multi-cloud design, Terraform and Terragrunt pipelines, and Grafana/Prometheus observability programs. I mentor graduates and share best practices across SRE and cloud automation to help teams move faster, safer, and more sustainably.
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Language
English
Fluent
Work Experience
Founding Platform Engineer at Magnatic Labs Ltd.
August 1, 2025 - PresentLed foundational platform work to enable SOC 2 Type 2 compliance. Improved Terraform codebase with Terragrunt-based optimisations; delivered an initial PostgreSQL disaster recovery plan; implemented Terraform compliance testing CI guardrails; introduced Argo CD for continuous deployments.
Site Reliability Engineer (Contractor) at Citigroup Global Markets Ltd.
October 1, 2023 - August 1, 2025Delivered scalable Prometheus/Thanos/Grafana observability stack to support established SLIs/SLOs. Enabled service metrics exports and synthetic testing to model availability and latency. Mentored teams on Kubernetes, observability, and Site Reliability Engineering; roadmapped and managed delivery across the initiative.
Lead Site Reliability Engineer at Arriva UK
January 1, 2022 - August 1, 2023Architected AWS multi-region/multi-account network and automation solution to support compartmentalisation and data sovereignty. Delivered ~40% operational cost reduction via S3 storage tiering, EKS deployment migrations to spot instance node groups, and improved resource tagging for actionable billing reports. Led Grafana-based observability for right-sizing AWS managed services and EKS workloads; directed SRE/DevOps coaching for a 40+ person team.
Senior Site Reliability Engineer at Sky UK
August 1, 2020 - January 1, 2022Architected Terraform and Terragrunt automation to deliver multi-account, multi-cloud (AWS and GCP) network governance. Led dual-cloud Kubernetes solution (EKS and GKE) for a multi-tenant hosting environment. Produced and defended proposals for senior leadership and mentored graduates on SRE and cloud automation best practices.
Principal Automation Engineer at Nuance Communications Inc.
January 1, 2009 - October 1, 2019Led globally distributed artifact service supporting deployments, configuration, integrations and monitoring. Built GitLab pipelines for building (Packer) Azure VM images and Terraform modules for multi-tenant infrastructure provisioning. Maintained Foreman, FreeIPA/IDMS and Sensu/Graphite/Graphana monitoring for ~2000 VMs.
Systems Administrator at MoneyAM Ltd.
February 1, 2007 - January 1, 2009Provided foundational systems administration across AIX/Linux and enterprise environments, supporting scale and reliability for financial services teams.
Junior Systems Administrator at WENN Ltd.
June 1, 2006 - February 1, 2007Supported IT operations and infrastructure provisioning in a fast-paced media environment, assisting with incident response and proactive monitoring.
Education
Postgraduate Diploma in Analytical Chemistry at Birkbeck, University of London
October 1, 2014 - September 1, 2019Bachelor of Science (Hons) Molecular Science at The Open University
August 1, 2012 - August 1, 2016Qualifications
Industry Experience
Software & Internet, Financial Services, Media & Entertainment, Telecommunications, Transportation & Logistics, Professional Services
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Hire Alexander Talbot today
To get started post up your job and then invite Alexander Talbot to your job.