Hi, I’m Zohaib Amir. I’m a Lead Data Engineer with 7+ years of experience designing and scaling cloud-native data ecosystems. I specialize in building end-to-end data platforms using PySpark, Airflow, and DBT across Redshift, Snowflake, Synapse, and BigQuery to empower data-driven decision making. I’ve delivered multi-cloud strategies (AWS, Azure, GCP) focusing on governance, reliability, and cost efficiency, while mentoring cross-regional teams and partnering with stakeholders to turn complex data into actionable insights and AI-powered products.

Zohaib Aamir

Hi, I’m Zohaib Amir. I’m a Lead Data Engineer with 7+ years of experience designing and scaling cloud-native data ecosystems. I specialize in building end-to-end data platforms using PySpark, Airflow, and DBT across Redshift, Snowflake, Synapse, and BigQuery to empower data-driven decision making. I’ve delivered multi-cloud strategies (AWS, Azure, GCP) focusing on governance, reliability, and cost efficiency, while mentoring cross-regional teams and partnering with stakeholders to turn complex data into actionable insights and AI-powered products.

Available to hire

Hi, I’m Zohaib Amir. I’m a Lead Data Engineer with 7+ years of experience designing and scaling cloud-native data ecosystems. I specialize in building end-to-end data platforms using PySpark, Airflow, and DBT across Redshift, Snowflake, Synapse, and BigQuery to empower data-driven decision making.

I’ve delivered multi-cloud strategies (AWS, Azure, GCP) focusing on governance, reliability, and cost efficiency, while mentoring cross-regional teams and partnering with stakeholders to turn complex data into actionable insights and AI-powered products.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

Lead Data Engineer at ILI Digital
January 1, 2025 - November 22, 2025
Consulted for a major European pharmaceutical e-commerce client, redesigning their Amazon Redshift data warehouse, cutting costs by 30% and eliminating reporting lag across hundreds of stakeholders.
Principal Data Engineer at ILI Digital
December 31, 2024 - December 31, 2024
Built and optimized DBT pipelines for a Snowflake-based data platform; enabled KPI calculations for work orders and supply chain; explored Ollama-based GPT AI features for personalized skincare recommendations; developed a document ingestion pipeline leveraging Azure Document Intelligence to extract, transform, and structure data for downstream AI model inference, accelerating data readiness and reducing manual preprocessing.
Senior Data Engineer at La3eb
January 31, 2024 - January 31, 2024
Integrated data products for La3eb app (gaming, e-commerce, and customer support) into a GCP BigQuery data warehouse. Led migration of legacy ETL jobs to GCP-hosted Airflow, integrating SAP, Genesys PureCloud, Getstream, ConnexEase, and Gameball for BI analytics. Built live dashboards for performance tracking and developed a community anti-spam bot.
Azure Data Engineer at Avanceon MEA
February 28, 2022 - February 28, 2022
Designed and implemented a cloud-based industrial data solution for high-frequency streaming data, optimizing processing and analysis. Streamlined ETL for a work order management system, automating tasks and reducing manual workload by 25%. Built scalable ETL via Azure Data Factory, collaborating with business and data analysts for real-time processing from hundreds of fuel stations. Contributed to data warehousing and visualization projects for major fuel retailers.
Senior Data Engineer at Odyssey Analytics
October 31, 2022 - October 31, 2022
Designed a scalable Big Data Lake on AWS using S3, Redshift, Lambda, Airflow, and Spark to support crypto price prediction pipelines, reducing data processing time and enabling faster decision making. Built real-time streaming pipelines with Kafka, Kinesis, Spark, and Redshift for millions of events daily and sub-second inferences. Led agile ceremonies and coordinated with stakeholders to maintain 100% on-time delivery across multiple data engineering initiatives. Integrated car auction APIs into the data warehouse via AWS ETL pipelines, triaging data availability and automating manual processes.
Senior Data Engineer at ILI Digital
December 1, 2024 - December 1, 2024
Built and optimized DBT pipelines to maintain a scalable Snowflake-based data platform, enabling calculation of critical KPIs for work orders and supply chain performance in the aerospace industry, improving operational visibility and data-driven decision-making. Explored and engineered smart feature pipelines using the Ollama model to power a GPT-based AI engine for generating personalized skincare product recommendations, enhancing recommendation accuracy and user engagement. Built a documentation ingestion pipeline for PDFs, text files, and CSVs, leveraging Azure Document Intelligence to extract, transform, and structure data for downstream AI model inference, accelerating data readiness and reducing manual preprocessing. Engineered a PySpark pipeline to decrypt, normalize, and ingest 100GB+ of data daily, boosting processing efficiency by 60% and ensuring secure downstream access.
Senior Data Engineer at La3eb
January 1, 2024 - January 1, 2024
Integrated La3eb app data products including third-party apps for gaming, e-commerce, and customer support into a GCP BigQuery data warehouse. Led migration of legacy ETL jobs to GCP-hosted Airflow, integrating SAP, Genesys PureCloud, GetStream, ConnexEase, and Gameball for BI analytics. Integrated marketing ads performance metrics from Facebook, Instagram, Twitter, YouTube, and Google Ads using Fivetran. Developed live Looker dashboards in collaboration with multiple teams for performance tracking and built a spam detection bot for community channels.
Senior Data Engineer at ODYSSEY ANALYTICS
October 1, 2022 - October 1, 2022
Designed and implemented a scalable Big Data Lake using AWS S3, Redshift, Lambda, Airflow, and Spark to support crypto rate prediction pipelines, reducing data processing time by 40% and directly enhancing trading decision accuracy. Built real-time streaming pipelines with Kafka, Kinesis, Spark, and Redshift, enabling sub-second model inference and processing millions of events daily. Led agile ceremonies and coordinated with stakeholders, maintaining 100% on-time delivery across multiple data engineering initiatives. Integrated car auction APIs into the data warehouse via AWS ETL pipelines, tripling data availability and automating manual processes.
Azure Data Engineer at AVANCE ON MEA
February 1, 2022 - February 1, 2022
Led the design and implementation of a cloud-based industrial data solution for high-frequency streaming data, optimizing data processing and analysis. Streamlined ETL processes for a work order management system, automating tasks and reducing manual workload by 25%. Developed a scalable ETL solution using Azure Data Factory, collaborating with business and data analysts for real-time data processing from hundreds of fuel stations. Contributed to data warehousing and visualization projects for major fuel retailers, enhancing their data analysis capabilities.

Education

Bachelor of Science (HONS), Computer Science at University of Central Punjab
October 6, 2014 - July 20, 2018

Qualifications

Azure Data Engineer Associate
January 11, 2030 - November 22, 2025
Azure Data Engineer Associate
January 11, 2030 - November 22, 2025

Industry Experience

Software & Internet, Life Sciences, Gaming, Professional Services, Healthcare, Media & Entertainment, Other