I am Rui Bian, a Site Reliability Engineer with hands-on experience automating and operating high-capacity, high-availability systems at scale across distributed multi-cloud environments. I thrive on building resilient infrastructure, reducing toil, and improving MTTR, all while delivering delightful user experiences for global services. I specialize in distributed systems, automation, and cloud infrastructure. My work spans from designing cloud-native microservices to refining observability and incident response, with a focus on scalable, cost-efficient solutions that empower teams to move faster with confidence.

Rui Bian

PRO

I am Rui Bian, a Site Reliability Engineer with hands-on experience automating and operating high-capacity, high-availability systems at scale across distributed multi-cloud environments. I thrive on building resilient infrastructure, reducing toil, and improving MTTR, all while delivering delightful user experiences for global services. I specialize in distributed systems, automation, and cloud infrastructure. My work spans from designing cloud-native microservices to refining observability and incident response, with a focus on scalable, cost-efficient solutions that empower teams to move faster with confidence.

Available to hire

I am Rui Bian, a Site Reliability Engineer with hands-on experience automating and operating high-capacity, high-availability systems at scale across distributed multi-cloud environments. I thrive on building resilient infrastructure, reducing toil, and improving MTTR, all while delivering delightful user experiences for global services.

I specialize in distributed systems, automation, and cloud infrastructure. My work spans from designing cloud-native microservices to refining observability and incident response, with a focus on scalable, cost-efficient solutions that empower teams to move faster with confidence.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

Javanese
Advanced
Aragonese
Advanced

Work Experience

Site Reliability Engineer at Xiaomi Technologies
August 1, 2025 - Present
Maintained 99.99% uptime for high-capacity services for millions of daily users across distributed multi-cloud infrastructure with 24/7 on-call duty. Improved alert, Grafana and ELK-based monitoring, increasing availability and resolution efficiency. Supported Kubernetes deployments, load balancing and domain/proxy configuration. Built automation tools including a Terraform-managed alert system and a Python verification tool to cut toil by over 90%. Performed cost analysis with SQL and Python. Designed and deployed cloud-native Golang microservices using gRPC, achieving <100ms p99 latency with scalable, cost-optimized infrastructure using auto-scaling and spot instances.
Automation Engineer (Internship) at Autodesk
January 1, 2023 - June 1, 2023
Built Jenkins CI/CD pipelines on Azure delivering automated testing infrastructure for 500K+ enterprise users with 99.5% pipeline reliability. Reduced automation execution time by 4x and weekly operational costs by 75% via .NET Core migration and performance optimization. Designed modular C# test framework with 300+ test cases, improving observability and reducing production incidents by 30%. Participated in Agile/DevOps practices including sprint planning, code reviews, and incident post-mortems.

Education

Bachelor of Engineering (Honours), Computer Engineering, Distinction (4.43/5) at National University of Singapore
August 1, 2021 - May 1, 2025
Exchange Program, Computer Science at Australian National University
February 1, 2024 - June 1, 2024
Exchange Program, Computer Science at Australian National University
February 1, 2024 - June 1, 2024

Qualifications

Add your qualifications or awards here.

Industry Experience

Computers & Electronics, Software & Internet, Professional Services, Telecommunications, Media & Entertainment