Hi, I’m Mahfuzealahi Noman, a Senior Data Software Engineer based in Kuala Lumpur, Malaysia. I specialize in designing scalable, resilient data solutions that power business intelligence, advanced analytics, and Gen AI initiatives. I have a proven track record of delivering efficient data pipelines, data quality frameworks, and cost-optimized architectures across cloud platforms and multi-source environments. I’m passionate about transforming big data into actionable insights, applying AI innovations to enhance decision-making, and mentoring forward-thinking teams to accelerate data-driven outcomes.

Mahfuzealahi Noman

Hi, I’m Mahfuzealahi Noman, a Senior Data Software Engineer based in Kuala Lumpur, Malaysia. I specialize in designing scalable, resilient data solutions that power business intelligence, advanced analytics, and Gen AI initiatives. I have a proven track record of delivering efficient data pipelines, data quality frameworks, and cost-optimized architectures across cloud platforms and multi-source environments. I’m passionate about transforming big data into actionable insights, applying AI innovations to enhance decision-making, and mentoring forward-thinking teams to accelerate data-driven outcomes.

Available to hire

Hi, I’m Mahfuzealahi Noman, a Senior Data Software Engineer based in Kuala Lumpur, Malaysia. I specialize in designing scalable, resilient data solutions that power business intelligence, advanced analytics, and Gen AI initiatives. I have a proven track record of delivering efficient data pipelines, data quality frameworks, and cost-optimized architectures across cloud platforms and multi-source environments.

I’m passionate about transforming big data into actionable insights, applying AI innovations to enhance decision-making, and mentoring forward-thinking teams to accelerate data-driven outcomes.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent
Bengali
Fluent

Work Experience

Senior Data Software Engineer at EPAM Systems
October 1, 2023 - November 22, 2025
Implemented an agentic AI solution with Python, FastAPI, LangGraph, LangChain, and Neo4j to automate data lineage and KPI extraction across Azure Data Factory, Synapse, and Databricks, improving lineage visibility and reducing data analysis effort for Data Ops and Project Managers by 30%. Architected and implemented a robust data quality framework and alerting system for business-critical datasets, saving manual effort by 70% and improving data quality for decision-making. Designed and developed end-to-end data processing pipelines consolidating multi-source datasets from data warehouse and external APIs, increasing data loading efficiency by 30%. Built a metadata-driven ingestion framework using Azure Data Factory, SQL DB, Logic Apps, Databricks, and SharePoint to handle diverse sources (SAP, Snowflake), reducing new pipeline development time by 60%. Optimized data storage and backup strategy in Azure Data Lake Storage, reducing storage costs by 25%. Migrated QlikView data processing
Senior Data Software Engineer at Sdn Bhd
September 1, 2023 - September 1, 2023
Developed ETL processes from SQL and NoSQL databases to BigQuery, and reverse ETL pipelines using Python with containerized workflow (Docker) for orchestration in Google Cloud Composer. Improved data quality testing capabilities by implementing Great Expectations. Implemented IAM access management based on team roles for GCP using Terraform, GitHub, and Atlantis. Implemented Change Data Capture (CDC) from PostgreSQL to BigQuery using FiveTran and contributed to cost optimization through data retention strategies and slot allocation for Cloud Composer and Looker. Designed and implemented data pipelines with Python (GeoPandas and PyProj) to stream and load location data into Oracle Spatial. Developed web scraping scripts with Python, Docker, Selenium and Oracle. Collaborated with cross-functional teams to gather requirements and prepare data API contracts. Implemented CI/CD practices via GitLab for ETL pipelines and backend.
Senior Data Software Engineer at Sdn Bhd
October 1, 2021 - October 1, 2021
Designed and developed ETL pipelines based on business requirements with PySpark, Hadoop, and AWS (S3, EMR, Lambda, Kinesis, and others). Maintained CRUD operations on AWS Redshift and MongoDB. Developed, maintained, and deployed RESTful APIs to query Big Data with Python (Flask-RESTful, Sanic) on AWS (EC2, Redshift). Performed geospatial analysis and visualization with Python, PostgreSQL, Apache Sedona, GeoPandas, and Kepler.gl. Conducted qualitative and quantitative analyses on new data sources and third-party APIs. Built automated data extraction and transformation back-end processes with Python, MongoDB, AWS Redshift and UNIX cron. Provided support and automation scripts for Market Analysts and Data Analysts using Python and AWS (Redshift, S3) and MongoDB. Adopted CI/CD best practices via GitLab for ETL pipelines and platform backend.
September 1, 2023 - September 1, 2023
Developed ETL processes from SQL and NoSQL databases to BigQuery, and reverse ETL using Python; containerized workflows with Docker for orchestration within Google Cloud Composer; enhanced data quality testing with Great Expectations. Implemented IAM access management for GCP using Terraform, GitHub, and Atlantis. Implemented Change Data Capture from PostgreSQL to BigQuery using FiveTran; contributed to BigQuery cost optimization with dedicated slot allocation for Cloud Composer and Looker. Designed and implemented data pipelines to ingest location data into Oracle Spatial; created web-scraping scripts with Python. Collaborated with cross-functional teams to gather requirements and prepare data API contracts.
at Sdn Bhd
October 1, 2021 - October 1, 2021
Designed and developed ETL pipelines based on business requirements with Python and NoSQL; maintained CRUD operations on AWS Redshift and MongoDB. Developed, maintained, and deployed RESTful APIs to query Big Data with Python, Flask-RESTful, Sanic, and AWS (EC2, Redshift). Performed geospatial analysis and visualization using Python, PostgreSQL, Apache Sedona and geolib. Contributed to data API contracts and data quality improvements.
Senior Data Software Engineer at EPAM Systems
October 1, 2023 - November 23, 2025
Implemented an Agentic AI solution with Python, FastAPI, LangGraph, and Neo4j to automate data lineage and KPI extraction across Azure Data Factory, Synapse, and Databricks, improving lineage visibility and reducing analysis effort for Data Ops and Project Managers by 30%. Architected and implemented a robust data quality framework and alerting system for business flat files, saving 70% manual effort and enhancing data quality for critical decisions. Designed end-to-end data processing pipelines consolidating multi-source datasets from data warehouse and external APIs, boosting data load efficiency by 30%. Built a metadata-driven ingestion framework using Azure (ADF, SQL DB, Logic Apps), Databricks, and SharePoint to handle diverse sources (e.g., SAP, Snowflake), reducing new pipeline development time by 60%. Optimized data storage and backup strategy in Azure Data Lake Storage, reducing storage costs by 25%. Migrated QlikView data processing to Azure Synapse Analytics, improving syste
Senior Data Software Engineer at EPAM Systems
September 1, 2023 - September 1, 2023
Developed ETL processes from SQL and NoSQL databases to BigQuery, and reverse ETL using Python with containerized orchestration in Google Cloud Composer; enhanced data quality testing capabilities by implementing Great Expectations. Implemented IAM access management based on team roles for GCP using Terraform, GitHub, and Atlantis. Implemented Change Data Capture (CDC) from PostgreSQL to BigQuery using FiveTran, contributing to cost optimization via data retention strategies and dedicated slot allocation for Cloud Composer and Looker. Designed data pipelines using Python (Geo Pandas and PyProj) for processing and loading location data into Oracle Spatial. Developed web scraping scripts with Python, Docker, Selenium, and Oracle. Collaborated with cross-functional teams to gather requirements and prepare data API contracts.
ETL Developer / Data Engineer at SDN BHD
October 1, 2021 - October 1, 2021
Designed and developed ETL pipelines based on business requirements using PySpark, Hadoop, and AWS (S3, EMR, Lambda, Kinesis). Maintained and performed CRUD operations on AWS Redshift and MongoDB. Developed, maintained, and deployed RESTful APIs to query Big Data with Python, Flask-RESTful, Sanic, and AWS (EC2, Redshift). Conducted geospatial analysis and built visualizations with Python, PostgreSQL, PostGIS, and Kepler.gl. Led quantitative and qualitative analyses of new data sources and third-party APIs alongside feature engineering.

Education

Bachelor of Computer Science at International Islamic University Malaysia
September 1, 2016 - August 1, 2020
Bachelor of Computer Science at International Islamic University Malaysia
September 1, 2016 - August 1, 2020
Bachelor of Computer Science at International Islamic University Malaysia
September 1, 2016 - August 1, 2020

Qualifications

Azure Data Engineer Associate
January 1, 2024 - November 22, 2025
IIUM Dean's Award
January 1, 2017 - December 31, 2020
Oracle Certified: Oracle Academy Certification
January 1, 2019 - November 22, 2025
Automation Anywhere Certified Advanced RPA Professional
January 1, 2020 - November 22, 2025
Microsoft Certifications: Azure Fundamentals
January 1, 2020 - November 22, 2025
Microsoft Certifications: Dynamics 365 Fundamentals
January 1, 2020 - November 22, 2025
Microsoft Certifications: Power Platform Fundamentals
January 1, 2020 - November 22, 2025
Azure Data Engineer Associate
January 1, 2024 - November 22, 2025
Automation Anywhere Certified Advanced RPA Professional
January 1, 2020 - November 22, 2025
Microsoft Certification: Azure Fundamentals
January 1, 2020 - November 22, 2025
Microsoft Certification: Dynamics 365 Fundamentals
January 1, 2020 - November 22, 2025
Microsoft Certification: Power Platform Fundamentals
January 1, 2020 - November 22, 2025
Oracle Academy Certification
January 1, 2019 - November 22, 2025
Azure Data Engineer Associate
January 1, 2024 - November 23, 2025
Automation Anywhere Certified Advanced RPA Professional
January 1, 2020 - November 23, 2025
Oracle Academy Certification
January 1, 2019 - November 23, 2025
IIUM Dean's Award
January 1, 2017 - January 1, 2020
Microsoft Certifications (Dynamics / Power Platform etc.)
January 1, 2020 - November 23, 2025

Industry Experience

Software & Internet, Professional Services, Computers & Electronics