Available to hire
Hi, I’m Mankirat Bhamra, a Data Engineer focused on building robust, scalable data pipelines and cloud-based architectures. I design, develop, and optimize end-to-end data workflows to enable reliable analytics across large enterprises.
I excel at ETL with Python, SQL, and Apache Airflow, and I’ve led migrations from on-premises systems to AWS and Azure. I’m passionate about performance tuning, cost optimization, security, and using technologies like Hadoop, Spark, Hive, and Kafka for real-time processing and insightful analytics. I enjoy collaborating with cross-functional teams to deliver data models that empower business decision-making.
Skills
See more
Language
English
Fluent
Work Experience
Data Engineer at DXC Technology
January 1, 2024 - November 4, 2025Designed and implemented end-to-end ETL pipelines using Python, SQL, and Apache Airflow to process high-volume enterprise data into cloud-based data warehouses. Migrated legacy on-premise data systems to AWS S3 and Redshift, implementing IAM and CloudFormation for secure, automated infrastructure provisioning. Optimized data lake architecture on AWS Glue and Athena, improving query performance by 30% and reducing storage costs through partitioning and compression strategies. Integrated streaming data ingestion using Apache Kafka and Lambda for near real-time analytics dashboards in Power BI and Tableau. Developed data quality frameworks with PySpark and Pandas, automating validation checks and anomaly detection across diverse data sources. Partnered with cross-functional teams to enable business intelligence and advanced analytics, ensuring scalable and reliable data models that support predictive insights.
Data Engineer at Infinite Infolab
July 1, 2022 - July 1, 2022Built scalable big data pipelines using Hadoop, Hive, Pig, and Spark to handle multi-terabyte data processing and analytics. Designed and managed relational and NoSQL databases (MySQL, Cassandra, MongoDB) for high-availability and optimized query performance. Automated data ingestion and transformation workflows using Apache Airflow and SSIS, enabling faster data delivery to downstream applications. Created interactive dashboards and advanced reports in Tableau, Power BI, and Looker, driving actionable insights for clients across multiple industries. Deployed machine learning feature pipelines with Pandas, Scikit-learn, and TensorFlow to support predictive models and improve decision-making. Implemented CI/CD pipelines using GitLab, Jenkins, and Docker, enabling rapid, reliable deployment of data engineering solutions. Enhanced data governance and compliance by establishing metadata management, lineage tracking, and security policies across data systems.
Education
Master of Science in Business Analytics at The University of Texas at Dallas
January 11, 2030 - May 1, 2024Bachelor of Engineering - BE, Electrical and Electronics Engineering at Dwarkadas J. Sanghvi College of Engineering
January 11, 2030 - May 1, 2022Qualifications
Microsoft Azure Fundamentals - AZ-900
January 11, 2030 - November 4, 2025AWS Academy Graduate
January 11, 2030 - November 4, 2025Microsoft Power BI Data Analyst Associate
January 11, 2030 - November 4, 2025Tableau eLearning certified Data Analyst
January 11, 2030 - November 4, 2025Google Analytics 4 certified
January 11, 2030 - November 4, 2025Campaign Manager 360 certified
January 11, 2030 - November 4, 2025Alteryx Designer Core
January 11, 2030 - November 4, 2025Industry Experience
Software & Internet, Professional Services, Computers & Electronics
Skills
See more
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Plano today.