I’m Mahnoor Zahid, a data engineering consultant with over 4 years of hands-on experience building scalable ETL pipelines on AWS and Databricks. I enjoy turning complex data into reliable, secure analytics platforms and collaborating with cross-functional teams to deliver cloud-native solutions. From migrating HDP to CDP on AWS to designing event-driven real-time data lakes and cross-region ingestion frameworks, I focus on governance, security, and cost optimization while delivering robust data platforms.

Mahnoor Zahid

I’m Mahnoor Zahid, a data engineering consultant with over 4 years of hands-on experience building scalable ETL pipelines on AWS and Databricks. I enjoy turning complex data into reliable, secure analytics platforms and collaborating with cross-functional teams to deliver cloud-native solutions. From migrating HDP to CDP on AWS to designing event-driven real-time data lakes and cross-region ingestion frameworks, I focus on governance, security, and cost optimization while delivering robust data platforms.

Available to hire

I’m Mahnoor Zahid, a data engineering consultant with over 4 years of hands-on experience building scalable ETL pipelines on AWS and Databricks. I enjoy turning complex data into reliable, secure analytics platforms and collaborating with cross-functional teams to deliver cloud-native solutions.

From migrating HDP to CDP on AWS to designing event-driven real-time data lakes and cross-region ingestion frameworks, I focus on governance, security, and cost optimization while delivering robust data platforms.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Work Experience

Consultant at Systems Ltd / Tec hvista Ltd
January 1, 2021 - Present
Led data engineering initiatives across Regeneron Pharmaceuticals and internal PoC projects. Migrated legacy HDP/HDF to CDP on AWS with cluster sizing, security baselines, and VPC policies; authored CloudFormation stacks to standardize IAM roles, S3 policies, KMS keys, and VPC components; mapped data lineage and validated Hive/Impala compatibility. Implemented cutover rehearsals and runbooks for operations and incident response; reduced migration risk and ensured data integrity.
Data Engineer at Regeneron Pharmaceuticals
February 1, 2023 - Present
Data Engineering (Feb 2023 – Present). Led HDP → CDP migration on AWS; designed configuration-driven routing for ingestion and target schemas; built idempotent, exactly-once processing with DLQ handling; parallelized PySpark jobs; instrumented CloudWatch and incident routing; achieved ingestion latency reduction and >99% job reliability. Implemented event-driven real-time data lake ingestion using MWAA, EventBridge, Lambda, and SQS; developed EU CRM Spark-based ETL for cross-region CRM data; established governance and security baselines.
Data Platform Operations at Regeneron Pharmaceuticals
January 1, 2022 - January 31, 2023
Data Platform Infrastructure Operations. Managed day-to-day data operations for HDP, EMR, and Databricks during Hortonworks to AWS-managed services migration; oversaw governance, cost-reduction initiatives; provided Redshift, RDS, and Hive DBA support; automated reporting via Python.
Consultant at System Limited
September 1, 2021 - December 31, 2021
Internal POC: end-to-end ML pipeline prototype. Stack included Azure, MLFlow, Python, MS SQL, Docker, Kubernetes, Jenkins; designed drift management, model retraining, and performance monitoring; demonstrated end-to-end ML workflow.

Education

Bachelor’s in Software Engineering at University of the Punjab
January 11, 2030 - January 7, 2026

Qualifications

AWS Certified Data Engineer
January 11, 2030 - January 7, 2026
AWS Certified Solutions Architect – Associate
January 11, 2030 - January 7, 2026
Azure Fundamentals
January 11, 2030 - January 7, 2026

Industry Experience

Computers & Electronics, Software & Internet, Professional Services