Hi, I'm Siyuan Liu, a data scientist with a Master of Data Science and Artificial Intelligence from the University of Waterloo and a Bachelor of Science in Computer Science from the University of Alberta. I combine strong statistical modeling, ML engineering, and data visualization to translate complex data into actionable insights. I enjoy building end-to-end analytics solutions that empower teams to make data-driven decisions. I'm passionate about scalable forecasting, interactive dashboards, and automating data workflows. My experience ranges from developing SHAP-enabled forecasting models and Python data-pipelines at IPEX to conducting ML research on phrase detection at the University of Alberta. I look forward to applying my skills to real-world problems and delivering measurable business impact.

Siyuan Liu

Hi, I'm Siyuan Liu, a data scientist with a Master of Data Science and Artificial Intelligence from the University of Waterloo and a Bachelor of Science in Computer Science from the University of Alberta. I combine strong statistical modeling, ML engineering, and data visualization to translate complex data into actionable insights. I enjoy building end-to-end analytics solutions that empower teams to make data-driven decisions. I'm passionate about scalable forecasting, interactive dashboards, and automating data workflows. My experience ranges from developing SHAP-enabled forecasting models and Python data-pipelines at IPEX to conducting ML research on phrase detection at the University of Alberta. I look forward to applying my skills to real-world problems and delivering measurable business impact.

Available to hire

Hi, I’m Siyuan Liu, a data scientist with a Master of Data Science and Artificial Intelligence from the University of Waterloo and a Bachelor of Science in Computer Science from the University of Alberta. I combine strong statistical modeling, ML engineering, and data visualization to translate complex data into actionable insights. I enjoy building end-to-end analytics solutions that empower teams to make data-driven decisions.

I’m passionate about scalable forecasting, interactive dashboards, and automating data workflows. My experience ranges from developing SHAP-enabled forecasting models and Python data-pipelines at IPEX to conducting ML research on phrase detection at the University of Alberta. I look forward to applying my skills to real-world problems and delivering measurable business impact.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Data Scientist, Co-op at IPEX Inc.
May 1, 2024 - August 1, 2024
Developed interactive Power BI dashboards to analyze the company’s supply chain and uncover data patterns. Built end-to-end data pipelines using Python, SQL (PostgreSQL), and pandas to extract, clean, transform, and aggregate large datasets into modeling-ready dataframes; applied ARIMA-based time series modeling to handle missing and irregular observations, enabling statistical consistency for downstream forecasting models. Built SHAP-enabled forecasting models (Random Forest, LightGBM) that assessed the contribution of key drivers to model predictions; conducted A/B testing, out-of-sample validation to ensure model reliability across different demand conditions; designed an automation pipeline to streamline warehouse order lead time forecasting. Delivered advanced forecasting solutions that improved warehouse planning efficiency with a reduced manual workload, driving ~$10M savings in estimated annual business impact.
Machine Learning Research Assistant at University of Alberta
March 1, 2022 - August 1, 2022
Developed machine learning models on the task of phrase detection (classification) using TensorFlow and Transformers, with a focus on infrastructure experimentation, performance benchmarking and error analysis. Created computationally efficient word embedding representations and parallelized the training workflow to reduce training and tuning time across large-scale text documents. Conducted systematic comparisons across multiple corpus and model variants, ensuring cross-corpus compatibility and improving performance by 3% on average in F1-Score while maintaining model stability. Produced performance visualizations using Matplotlib to inform technical decisions on model structure and tuning.

Education

Master of Data Science and Artificial Intelligence (Co-op) at University of Waterloo
September 1, 2023 - April 1, 2025
Bachelor of Science (Honours) at University of Alberta
September 1, 2018 - April 1, 2023

Qualifications

Canadian Information Processing Society Scholarship
January 11, 2030 - January 28, 2026

Industry Experience

Software & Internet

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more