Hi, I’m Brian Chen—a senior backend engineer focused on cloud-native distributed systems, microservices, and large-scale compute orchestration. I led backend and infra development at AWS and Meta, building highly reliable, observable, and performance-critical services powering networks, container scheduling, and AI/ML workloads across millions of machines. With deep expertise in Golang, Python, Kubernetes, gRPC, OpenTelemetry, and cloud platforms, I focus on scalability, reliability, and developer productivity. I work closely with networking, security, and ML-driven traffic analysis teams to ensure simulation fidelity aligns with production workloads.

Hi, I’m Brian Chen—a senior backend engineer focused on cloud-native distributed systems, microservices, and large-scale compute orchestration. I led backend and infra development at AWS and Meta, building highly reliable, observable, and performance-critical services powering networks, container scheduling, and AI/ML workloads across millions of machines. With deep expertise in Golang, Python, Kubernetes, gRPC, OpenTelemetry, and cloud platforms, I focus on scalability, reliability, and developer productivity. I work closely with networking, security, and ML-driven traffic analysis teams to ensure simulation fidelity aligns with production workloads.

Available to hire

Hi, I’m Brian Chen—a senior backend engineer focused on cloud-native distributed systems, microservices, and large-scale compute orchestration. I led backend and infra development at AWS and Meta, building highly reliable, observable, and performance-critical services powering networks, container scheduling, and AI/ML workloads across millions of machines.

With deep expertise in Golang, Python, Kubernetes, gRPC, OpenTelemetry, and cloud platforms, I focus on scalability, reliability, and developer productivity. I work closely with networking, security, and ML-driven traffic analysis teams to ensure simulation fidelity aligns with production workloads.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

Software Engineer at Canva
September 1, 2020 - December 1, 2020
Contributed to Canva's static export pipeline, implementing backend rendering flows for PDF, SVG, and print-quality assets using Golang. Improved concurrency and memory efficiency of export workers by 30% through profiling and refactoring. Enhanced backend-to-front-end parameter mapping to ensure pixel-accurate exported outputs. Built micro UI improvements in React for export settings and preflight validation workflows.
Software Engineer at Meta
January 1, 2021 - February 1, 2023
Built cloud-scale backend services for Meta's container runtime lifecycle including scheduling, admission control, rollout validation, and host health management. Designed fault-tolerant microservice architectures in Golang and Python, interfacing with cgroup v2, eBPF, and Meta's cluster scheduler Twine. Built real-time telemetry-based distributed tracing, metrics, and structured logging, significantly improving observability and root-cause analysis across dozens of microservices. Built operator-facing dashboards (React + TypeScript) to visualize scheduling status, rollout progress, and failure patterns; created internal CLIs and automation services (Node.js/Nest.js) to improve developer productivity and velocity. Collaborated with networking, security, and ML-driven traffic analysis teams.
Senior Backend Engineer at Meta
February 1, 2023 - September 1, 2024
Led backend engineering for container resource isolation and multi-tenant scheduling platforms, supporting AI/ML, GPU, and accelerator-heavy workloads across millions of hosts. Designed and implemented distributed backend services in Golang and Python, interfacing with cgroup v2, eBPF, and Meta's cluster scheduler Twine. Built real-time telemetry and data pipelines powering AI-driven scheduling feedback loops, enabling smarter workload placement and improved cluster utilization. Developed high-throughput metrics ingestion, aggregation, and analysis used by capacity planning and ML optimization teams. Maintained and extended BeLow, Meta's open-source time-aware resource monitor; reduced runtime overhead by 25% and improved display accuracy and sampling efficiency. Delivered backend APIs and data services powering internal observability dashboards for CPU/GPU isolation, contention detection, and anomaly analysis. Built operator-facing dashboards (React + TypeScript) to visualize scheduli
Senior Backend Engineer at Amazon Web Services (AWS)
September 1, 2024 - November 1, 2025
Senior backend engineer leading backend and infra development for a cloud-native distributed network emulation platform used across AWS networking orgs to validate routing, packet forwarding, and configuration changes prior to global rollout. Built event-driven orchestration pipelines enabling deterministic replay of real-world network traffic, multi-hop topologies, and fault-injection scenarios at datacenter scale. Improved end-to-end latency by 45% through asynchronous task pipelines, queue-based execution, optimized serialization, and service decomposition. Implemented REST and gRPC APIs to power internal observability dashboards; designed OpenTelemetry-based distributed tracing, metrics, and structured logging to improve visibility and root-cause analysis across many microservices. Built operator-facing dashboards to visualize topology, status, and failure patterns, accelerating on-call response and debugging. Created internal CLIs and automation services (Node.js / Nest.js) that i
Software Engineer at Google Maps
December 1, 2019 - February 1, 2020
Implemented AR Live View navigation components, integrated real-time sensor fusion and 3D overlay rendering. Optimized latency-sensitive rendering code in Java/Kotlin to ensure smooth AR alignment during navigation. Created internal diagnostic and calibration visualization tools for AR sensor alignment on various devices.

Education

Exchange Semester in Computer Science at The University of Texas at Austin
August 1, 2017 - December 1, 2017
Bachelor of Engineering - Computer Software Engineering at The University of Queensland
August 1, 2012 - December 1, 2016
Exchange Semester in Computer Science at The University of Texas at Austin
August 1, 2017 - December 1, 2017
Bachelor of Engineering - Computer Software Engineering at The University of Queensland
August 1, 2012 - December 1, 2016

Qualifications

National Scholarship
January 11, 2030 - January 16, 2026
Multiple School Scholarships
January 11, 2030 - January 16, 2026
Excellent Student Cadet Award
January 11, 2030 - January 16, 2026
National Scholarship
January 11, 2030 - January 16, 2026
Multiple School Scholarships
January 11, 2030 - January 16, 2026
Excellent Student Cadet Award
January 11, 2030 - January 16, 2026

Industry Experience

Software & Internet, Telecommunications, Professional Services