I am an experienced Data Engineer with over 5 years of hands-on experience designing scalable data platforms, building cloud-native ETL/ELT pipelines, and deploying machine learning solutions. I am proficient with tools such as Google Cloud, Airflow, Spark, and MLOps frameworks. Currently, I am pursuing a Master of Data Science & Innovation at UTS and seeking Data or AI Engineer roles in Sydney to apply my technical expertise and create business impact. I have a strong track record of migrating data warehouses and pipelines across cloud platforms leading to cost reductions and improved processing times. I am skilled in mentoring, course design, and leading teams. My experience also covers designing big data architectures, real-time data pipelines, and machine learning deployment systems. I am passionate about advancing AI applications in business and optimizing data solutions for efficiency.

Pham Ngoc Quang

I am an experienced Data Engineer with over 5 years of hands-on experience designing scalable data platforms, building cloud-native ETL/ELT pipelines, and deploying machine learning solutions. I am proficient with tools such as Google Cloud, Airflow, Spark, and MLOps frameworks. Currently, I am pursuing a Master of Data Science & Innovation at UTS and seeking Data or AI Engineer roles in Sydney to apply my technical expertise and create business impact. I have a strong track record of migrating data warehouses and pipelines across cloud platforms leading to cost reductions and improved processing times. I am skilled in mentoring, course design, and leading teams. My experience also covers designing big data architectures, real-time data pipelines, and machine learning deployment systems. I am passionate about advancing AI applications in business and optimizing data solutions for efficiency.

Available to hire

I am an experienced Data Engineer with over 5 years of hands-on experience designing scalable data platforms, building cloud-native ETL/ELT pipelines, and deploying machine learning solutions. I am proficient with tools such as Google Cloud, Airflow, Spark, and MLOps frameworks. Currently, I am pursuing a Master of Data Science & Innovation at UTS and seeking Data or AI Engineer roles in Sydney to apply my technical expertise and create business impact.

I have a strong track record of migrating data warehouses and pipelines across cloud platforms leading to cost reductions and improved processing times. I am skilled in mentoring, course design, and leading teams. My experience also covers designing big data architectures, real-time data pipelines, and machine learning deployment systems. I am passionate about advancing AI applications in business and optimizing data solutions for efficiency.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Advanced
Vietnamese
Fluent

Work Experience

Data Engineer at VMO Holdings
July 31, 2025 - August 27, 2025
Migrated Data Warehouse, Data Pipeline, and notebooks from Azure Data Factory, Azure Storage, Databricks, and Azure Synapse Analytics to BigQuery, Google Cloud Storage, and Composer, reducing cost by 30% and improving processing time by 40%.
Data Engineer at Cole.vn
May 1, 2025 - Present
Led a team of 3 to build a Data Platform, developed ETL pipelines, and maintained the system. Built on-premise system using Lakehouse architecture with Object Storage, MinIO, Iceberg, dbt, Spark, Superset, and PowerBI for visualization on Kubernetes. Built Big Data system on Google Cloud Platform services including Dataflow, BigQuery, and Cloud Function. Designed data models and ETL pipelines from raw to bronze, silver, and gold zones. Developed Customer 360 Data Platform with a Feature Store of over 4000 customer attributes grouped by balance, deposit, loan, transaction, mobile and internet banking. Utilized dbt and Spark to clean, de-duplicate, and transform data for dashboards and machine learning systems like churn prediction.
Data Engineer at An Binh Commercial Joint Stock Bank
April 30, 2025 - August 27, 2025
Mentored a Data Analyst on company data visualization projects. Built effective ETL pipelines for analytics using Airflow, PySpark, and dbt. Developed real-time data processing pipelines managing large data volumes reducing latency in report submission.
Data Engineer at Vin Big Data Joint Stock Company
September 30, 2024 - August 27, 2025
Used Kafka, Spark Streaming, and Flink to build Change Data Capture (CDC) systems with Debezium for real-time data synchronization. Designed Big Data systems using Apache Doris, Delta Lake, Airflow, and Superset. Led a team of 5 people winning third-class prize in Vin Big Data Hackathon. Developed company-wide employee performance assessment platform integrating Jira data with data warehouse enabling productivity tracking and improved decision-making.
Data / MLOps Engineer at Ftech Co., Ltd
January 31, 2024 - August 27, 2025
Mentored a Data Engineer on big data systems and ETL pipeline development. Designed big data architecture for Data Center using Hadoop and Spark ecosystem for television data storage and processing. Developed ETL pipelines for BI reports and real-time data processing reducing latency. As an MLOps Engineer, researched and developed machine learning pipeline systems using Kubeflow, Feast, DVC, Seldon Core, and Triton to automate data preparation, model training, evaluation, and deployment, reducing manual machine learning pipeline processing time by 60%. As AI Engineer, led projects applying computer vision for automating construction tasks like face recognition and license plate recognition.
Data Engineer at Viettel Construction Corporation
May 31, 2021 - August 27, 2025
Developed big data system using Hadoop and Spark ecosystem providing large data storage and processing. Created ETL pipeline for BI reporting reducing manual effort by 70%. Designed dimensional modeling for data warehouse storing telecommunication data for OLAP facilitating data analysis and queries. Optimized SQL and PySpark code reducing processing time by 60%. Designed productivity optimization algorithm scheduling employees' daily tasks automatically to help managers allocate work efficiently and reduce wasted time.
AI Engineer (Internship) at Asilla Inc.
January 31, 2020 - August 27, 2025
Developed an image classifier model to detect NSFW images. Prepared and processed data for training and compiled other backbone ML architectures improving model performance by 10%. Collaborated with Data Engineering team to develop a system detecting and extracting information from images addressing limited data problems in real-time invoice product applications. Compiled meta ML architecture to text detection model improving performance by 5%.
Part-time Research Student at Hanoi University of Science and Technology
November 30, 2018 - August 27, 2025
Conducted research on sensor coverage in military management domain increasing area coverage by 5%. Applied and optimized genetic algorithms for sensor placement problems combining heuristic methods.
Data Engineer Teacher at University of Technology Sydney
June 30, 2027 - August 27, 2025
Designed and delivered comprehensive data engineering courses covering data pipelines, data warehousing, and big data processing. Mentored students on best practices in data engineering including data governance, optimization, and real-world case studies. Assessed student progress through regular evaluations and provided constructive feedback to enhance technical skills and industry readiness.
Data Engineer at VMO Holdings
July 31, 2025 - August 27, 2025
Led migration of data warehouse, data pipelines, and notebooks from Azure Data Factory, Azure Storage, Databricks, and Azure Synapse Analytics to BigQuery, Google Cloud Storage, and Composer, reducing cost by 30% and improving processing time by 40%.
Data Engineer at Cole.vn
May 1, 2025 - Present
Led a team of 3 to build data platform and develop ETL pipelines. Built an on-premise Lakehouse data platform using Object Storage, Airflow for orchestration, MinIO, Iceberg for storage, dbt and Spark for data transformation, Superset and PowerBI for visualization and deployed on Kubernetes. Developed Big Data systems on Google Cloud Platform using Dataflow, BigQuery, and Cloud Functions. Designed data models and ETL pipelines processing raw to bronze to silver and gold zones. Built a Customer360 data platform with over 4000 features sourced from various banking systems and apps, using dbt and Spark for data cleaning and transformation. Features used for dashboards and machine learning models such as churn prediction.
Data Engineer at An Binh Commercial Joint Stock Bank
April 30, 2025 - August 27, 2025
Mentored a Data Analyst on company data visualization projects. Developed effective ETL pipelines for analytics projects using Airflow, PySpark, and dbt. Built real-time data processing pipelines with large data volumes improving latency on report submissions.
Data Engineer at Vin Big Data Joint Stock Company
September 30, 2024 - August 27, 2025
Built change data capture systems using Debezium to sync data in real-time from source to various storages. Designed Big Data systems using Apache Doris, Delta Lake, Airflow, Superset. Led team of 5 to win third-class prize in hackathon with a company-wide employee performance assessment platform integrating JIRA data and a data warehouse for tracking productivity and improving decision-making.
Data/MLOps Engineer at Ftech Co., LTD
January 31, 2024 - August 27, 2025
Mentored a Data Engineer on company Big Data system and ETL pipeline development. Designed Big Data architecture for data center using Hadoop and Spark ecosystem to process television data. Built real-time data processing pipelines improving latency in report submission. Developed MLOps pipelines using Kubeflow, Feast, DVC, Seldon Core, Triton to automate data preparation, model training, evaluation, and deployment reducing manual workload by 60%. Led computer vision AI projects for construction automation like face recognition, license plate recognition. Compiled ensemble ML models to improve text detection performance by 5%.
Data Engineer at Viettel Construction Corporation
May 31, 2021 - August 27, 2025
Developed Big Data systems using Hadoop and Spark ecosystems for handling large volume data and wrote ETL pipelines for BI reports reducing manual tasks by 70%. Designed dimensional modeling for Data Warehouse to support OLAP querying for telecommunication data. Optimized SQL and PySpark code to improve performances reducing data processing time by 60%. Designed algorithms for productivity optimization automating employee daily task scheduling, reducing wastage of time and increasing task completion.
AI Engineer (Internship) at Asilla Inc.
January 31, 2020 - August 27, 2025
Developed image classifier model to detect NSFW images. Prepared and processed data and compiled ML backbones improving model performance by 10%. Collaborated with Data Engineering team to build information extraction systems to overcome limited data issues in document processing, applied to real-time Invoice products. Compiled ensemble ML architectures to improve text detection by 5%.
Part-time Research Student at MSO Lab
November 30, 2018 - August 27, 2025
Conducted research on sensor coverage in military management domains expanding area coverage by 5%. Applied and optimized genetic algorithms with heuristic methods for sensor placement problems.
Data Engineer at Cole.vn
May 1, 2025 - Present
Led a team of 3 to build a data platform and maintain ETL pipelines. Built an on-premise data platform based on Lakehouse architecture with Object Storage. Developed systems using Airflow for orchestration, MinIO and Iceberg for storage, dbt and Spark for transformation, and Superset and PowerBI for visualization on Kubernetes. Built a Customer 360 data platform with a feature store comprising over 4000 features from multiple sources, enabling dashboards like CASA and machine learning systems such as churn prediction.
Data Engineer at VMO Holdings
July 31, 2025 - August 27, 2025
Migrated data warehouse, data pipeline, and notebooks from Azure Data Factory, Azure Storage, Databricks, Azure Synapse Analytics to BigQuery, Google Cloud Storage, and Composer, reducing costs by 30% and improving processing time by 40%.
Data Engineer at An Binh Commercial Joint Stock Bank
April 30, 2025 - August 27, 2025
Built big data systems using Google Cloud Platform services such as Dataflow, BigQuery, and Cloud Function. Designed data models and ETL pipelines processing data from raw to bronze zone, and further to silver and gold zones. Developed real-time data processing pipelines to reduce latency for report submissions. Mentored data analyst on data visualization projects.
Data Engineer at Vin Big Data Joint Stock Company
September 30, 2024 - August 27, 2025
Led a team of 5 in a hackathon winning third-class prize. Built a wide enterprise employee performance assessment platform including front-end website and integrated data warehouse. Designed and implemented big data system using Apache Doris, Delta Lake, Airflow, and Superset. Developed change data capture system using Debezium to sync data in real-time. Wrote ETL pipelines using Airflow and PySpark, and built real-time processing systems with a large data volume improving reporting latency.
Data Engineer at Ftech Co., LTD
January 31, 2024 - August 27, 2025
Developed big data system and wrote ETL pipelines for BI reporting reducing manual effort by 70%. Designed dimensional modeling for data warehouse to store telecommunication data facilitating data analysis and querying. Optimized SQL and PySpark code improving processing time and space cost by 60%. Designed a productivity optimization algorithm scheduling employees' daily tasks improving management efficiency.
MLOps Engineer / AI Engineer at Viettel Construction Corporation
May 31, 2021 - August 27, 2025
Mentored a data engineer on big data systems and ETL pipeline development. Designed big data architecture for data center using Hadoop and Spark ecosystem for television data. Developed machine learning pipelines using Kubeflow, Feast, DVC, Seldon Core, and Triton to automate repetitive tasks in data preparation, model training, evaluation, and deployment reducing manual pipeline processing time by 60%. Led computer vision projects applying facial recognition and license plate recognition to automate construction site work.
Data Engineer at Asilla Inc.
January 31, 2020 - August 27, 2025
Developed an image classifier model for NSFW images improving model performance by 10%. Collaborated to develop an information extraction system for images to overcome limited data issues in document information extraction applied to real-time invoicing products. Compiled ML architecture to text detection model improving performance by 5%.
Data Engineer Teacher at MSO Lab
November 30, 2018 - August 27, 2025
Designed and delivered comprehensive data engineering courses covering data pipelines, data warehousing, and big data processing. Mentored students on best practices including data governance, optimization, and real-world case studies. Assessed student progress through regular evaluations providing constructive feedback to enhance technical skills and industry readiness.
Data Engineer at VMO Holdings
July 31, 2025 - August 27, 2025
Migrated Data Warehouse, Data Pipeline, and notebook systems from Azure Data Factory, Azure Storage, Databricks, Azure Synapse Analytics to Google Cloud Storage and BigQuery. Implemented Composer to reduce costs by 30% and improve processing time by 40%.
Data Engineer at Cole.vn
May 1, 2025 - Present
Led a team of 3 to build a Data Platform and develop ETL pipelines. Built an on-premise data platform based on Lakehouse architecture using Object Storage. Developed systems using Airflow for orchestration, MinIO and Iceberg for storage, dbt and Spark for transformation, Superset and Power BI for visualization on Kubernetes. Designed data models and ETL pipelines to process data through bronze, silver, and gold zones. Built Customer360 Data Platform and Feature Store with over 4000 features from various data sources using dbt and Spark. Developed dashboards and machine learning systems for churn prediction.
Data Engineer at An Binh Commercial Joint Stock Bank
April 30, 2025 - August 27, 2025
Mentored a data analyst on company data visualization projects. Built effective ETL pipelines for analytics using Airflow, PySpark, and dbt. Developed real-time data processing pipelines to reduce latency in reports.
Data Engineer at Vin Big Data Joint Stock Company
September 30, 2024 - August 27, 2025
Designed and built big data systems using Kafka, Spark Streaming, Flink, Debezium for change data capture. Led a team of 5 to win third-class prize in a hackathon. Developed a company-wide employee performance assessment platform integrating Jira data to help managers improve productivity and decision-making.
Data / MLOps Engineer at Ftech Co., LTD
January 31, 2024 - August 27, 2025
Mentored a data engineer on big data system development and ETL pipeline creation. Designed big data architecture for data center using Hadoop and Spark ecosystem. Built real-time data processing pipelines for BI reporting, optimizing to reduce manual tasks by 70%. As MLOps Engineer, researched and developed ML pipeline system using Kubeflow, Feast, DVC, Seldon Core, and Triton to automate repetitive tasks and reduce manual pipeline processing time by 60%. Led AI projects applying computer vision to automate construction site tasks including face recognition and license plate recognition.
Data Engineer at Viettel Construction Corporation
May 31, 2021 - August 27, 2025
Developed big data systems with Hadoop and Spark, enabling large-scale data storage and processing. Created ETL pipelines for BI reports, reducing manual tasks by 70%. Designed dimensional modeling for telecommunications data warehouse facilitating efficient OLAP querying and data analysis. Optimized SQL and PySpark code to reduce data processing time by 60%. Designed algorithm for productivity optimization to schedule employee tasks automatically, improving task completion and reducing wasted time.
AI Engineer (Internship) at Asilla Inc.
January 31, 2020 - August 27, 2025
Developed image classifier models to detect NSFW images, improved model performance by 10% through preparation and backbone compilation. Collaborated with data engineering team to develop systems extracting image information for document information extraction, applied to real-time invoice products. Enhanced ML architecture for text detection models, improving performance by 5%.
Data Engineer Teacher at MSOLab
November 30, 2018 - August 27, 2025
Designed and delivered comprehensive Data Engineering courses covering topics such as data pipelines, data warehousing, and big data processing. Mentored students on best practices including data governance, optimization, and real-world case studies. Assessed student progress through regular evaluations and provided constructive feedback to enhance skills and industry readiness.

Education

Master of Data Science and Innovation at University of Technology Sydney
July 1, 2025 - June 30, 2027
Bachelor of Computer Science - Information Technology at Hanoi University of Science and Technology
August 1, 2015 - January 31, 2020
Master of Data Science and Innovation at University of Technology Sydney
July 1, 2025 - June 1, 2027
Computer Science - Information Technology at Hanoi University of Science and Technology
August 1, 2015 - January 1, 2020
Master of Data Science and Innovation at University of Technology Sydney
July 1, 2025 - June 30, 2027
Bachelor of Computer Science - Information Technology at Hanoi University of Science and Technology
August 1, 2015 - January 31, 2020
Master of Data Science and Innovation at University of Technology Sydney
July 1, 2025 - June 30, 2027
Bachelor of Computer Science - Information Technology at Hanoi University of Science and Technology
August 1, 2015 - January 31, 2020

Qualifications

IELTS Certificate
January 1, 2025 - August 27, 2025
TOEIC Certificate
January 1, 2019 - August 27, 2025
Deep Learning Certificate from Coursera
January 1, 2019 - August 27, 2025
IELTS Certificate
January 1, 2025 - August 27, 2025
TOEIC Certificate
January 1, 2019 - August 27, 2025
Deep Learning Certificate from Coursera
January 1, 2019 - August 27, 2025
IELTS Certificate
January 1, 2025 - August 27, 2025
TOEIC Certificate
January 1, 2019 - August 27, 2025
Deep Learning Certificate from Coursera
January 1, 2019 - August 27, 2025
IELTS Certificate
January 1, 2025 - August 27, 2025
TOEIC Certificate
January 1, 2019 - August 27, 2025
Deep Learning Certificate from Coursera
January 1, 2019 - August 27, 2025

Industry Experience

Education, Software & Internet, Financial Services, Telecommunications, Professional Services, Real Estate & Construction