I am Hamza Hanif Alam, a data engineer focused on building scalable data pipelines, deriving insights from complex datasets, and delivering data-driven solutions in banking, telecom, and cloud environments. I enjoy solving data quality challenges and turning raw data into reliable dashboards, ML-ready features, and actionable business decisions. My experience spans ETL development, real-time processing, and cross-functional collaboration to align data work with business needs. I thrive in fast-paced environments and love using Python, Spark, Kafka, and BI tools to empower teams with timely insights and robust data products. I’m always eager to learn new technologies, optimize data workflows, and contribute to impactful projects across analytics, engineering, and AI-enabled solutions.

Hamza Hanif Alam

I am Hamza Hanif Alam, a data engineer focused on building scalable data pipelines, deriving insights from complex datasets, and delivering data-driven solutions in banking, telecom, and cloud environments. I enjoy solving data quality challenges and turning raw data into reliable dashboards, ML-ready features, and actionable business decisions. My experience spans ETL development, real-time processing, and cross-functional collaboration to align data work with business needs. I thrive in fast-paced environments and love using Python, Spark, Kafka, and BI tools to empower teams with timely insights and robust data products. I’m always eager to learn new technologies, optimize data workflows, and contribute to impactful projects across analytics, engineering, and AI-enabled solutions.

Available to hire

I am Hamza Hanif Alam, a data engineer focused on building scalable data pipelines, deriving insights from complex datasets, and delivering data-driven solutions in banking, telecom, and cloud environments. I enjoy solving data quality challenges and turning raw data into reliable dashboards, ML-ready features, and actionable business decisions. My experience spans ETL development, real-time processing, and cross-functional collaboration to align data work with business needs.

I thrive in fast-paced environments and love using Python, Spark, Kafka, and BI tools to empower teams with timely insights and robust data products. I’m always eager to learn new technologies, optimize data workflows, and contribute to impactful projects across analytics, engineering, and AI-enabled solutions.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate

Work Experience

Data Engineer at Addo AI
January 1, 2025 - November 1, 2025
Developed GL-level MTD/YTD averages for adjustment and non-adjustment period-end balances for United Bank Limited (UBL). Created and validated SQL logic in Teradata during pre-production, later productionized in Informatica Developer. Extracted data from the Oracle Core Banking System (CBS), staged it in Cloudera Hive, and loaded curated datasets into Teradata. Collaborated with cross-functional teams to ensure pipeline reliability and data quality aligned with business requirements.
Data Engineer at Inseyab Consulting
January 1, 2025 - January 1, 2025
Utilized Kafka, Spark, and Docker to develop scalable and real-time data pipelines. Designed and maintained SQL Server databases for reliable data storage. Authored complex SQL queries for seamless data transactions and reporting. Managed deployments on on-premises and Azure cloud VMs, ensuring high availability and performance. Oversaw GitHub project lifecycle with a focus on version control and collaboration. Developed an attendance tracking system for TDRA enabling monitoring of employee anomalies, built Power BI dashboards, implemented Alteryx workflows, and explored ML techniques for enhanced data analysis and AI chatbot integration.
Data Engineer at Xloop Digital Services
March 1, 2024 - March 1, 2024
Designed and implemented an end-to-end ETL pipeline using MinIO for cloud data extraction, REST Proxy for IoT ingestion, Kafka for messaging, PySpark for transformation, and MongoDB for storage. Containerized microservices with Docker for portable deployment. Managed GitHub project lifecycle, ensuring version control and collaboration. Implemented real-time processing with AP Scheduler, achieving rapid ML outputs for occupancy estimation in under 10 seconds.

Education

BE, Electrical Engineering at NED University, Karachi
January 11, 2030 - January 1, 2022

Qualifications

Data Engineering Bootcamp
January 11, 2030 - November 1, 2025
Python for Data Science, AI and Development
January 11, 2030 - November 1, 2025
Introduction to Big Data with Spark and Hadoop
January 11, 2030 - November 1, 2025
Introduction to NoSQL Databases
January 11, 2030 - November 1, 2025
ETL Pipelines with Shell, Airflow and Kafka
January 11, 2030 - November 1, 2025
Databases and SQL for Data Science with Python
January 11, 2030 - November 1, 2025
Certified Alteryx Core Designer
January 11, 2030 - November 1, 2025

Industry Experience

Software & Internet, Professional Services, Telecommunications, Financial Services

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate