I am a Senior Data Engineer with 8+ years of experience designing and delivering scalable, cloud-based data platforms across enterprise environments. I specialize in building and optimizing ETL and ELT pipelines using Python, SQL, and Apache Spark for large-scale data processing, with hands-on experience across Azure data services, including Databricks, Azure Synapse, and Snowflake, to support analytics and data engineering workloads. I focus on enabling analytics and ML use cases through reliable, production-grade data pipelines, with a strong emphasis on data quality, governance, and observability. I’ve worked extensively in healthcare data processing, HIPAA-compliant workflows, HL7 and FHIR standards, and I collaborate with product, analytics, and compliance teams to deliver trusted, analytics-ready data solutions that empower data-driven decision making.

Imran Hussain

I am a Senior Data Engineer with 8+ years of experience designing and delivering scalable, cloud-based data platforms across enterprise environments. I specialize in building and optimizing ETL and ELT pipelines using Python, SQL, and Apache Spark for large-scale data processing, with hands-on experience across Azure data services, including Databricks, Azure Synapse, and Snowflake, to support analytics and data engineering workloads. I focus on enabling analytics and ML use cases through reliable, production-grade data pipelines, with a strong emphasis on data quality, governance, and observability. I’ve worked extensively in healthcare data processing, HIPAA-compliant workflows, HL7 and FHIR standards, and I collaborate with product, analytics, and compliance teams to deliver trusted, analytics-ready data solutions that empower data-driven decision making.

Available to hire

I am a Senior Data Engineer with 8+ years of experience designing and delivering scalable, cloud-based data platforms across enterprise environments. I specialize in building and optimizing ETL and ELT pipelines using Python, SQL, and Apache Spark for large-scale data processing, with hands-on experience across Azure data services, including Databricks, Azure Synapse, and Snowflake, to support analytics and data engineering workloads.

I focus on enabling analytics and ML use cases through reliable, production-grade data pipelines, with a strong emphasis on data quality, governance, and observability. I’ve worked extensively in healthcare data processing, HIPAA-compliant workflows, HL7 and FHIR standards, and I collaborate with product, analytics, and compliance teams to deliver trusted, analytics-ready data solutions that empower data-driven decision making.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Fluent

Work Experience

Senior Data Engineer at Gainwell
August 1, 2022 - Present
Designed Spark-based pipelines in Azure Synapse to process HL7/FHIR data, improving data readiness for ML inference and clinical deployment by 40%. Led Snowflake-native ingestion and orchestration with SnowPipe, Streams, and Tasks for near-real-time processing and automated change data capture, while enforcing RBAC, secure views, and data security controls. Built reusable ingestion and transformation frameworks with standardized logging and schema enforcement; provided architectural leadership for GenAI data pipelines, vector search, and retrieval systems. Implemented a dbt-based semantic layer and governance contracts to support analytics and AI workloads; automated model retraining and deployment workflows with CI/CD. Drove enterprise data governance and compliance controls (GDPR, CCPA, GLBA-aligned access, auditing) and established observability and incident response practices.
Big Data Engineer at Zoetis
November 1, 2019 - July 1, 2022
Built Spark-based data ingestion pipelines in Azure Synapse; maintained Feast and PGVector feature stores for AI initiatives across lines of business. Developed CI/CD pipelines to automate model packaging, validation, and deployment; implemented SQL-based anomaly detection to monitor data drift and maintain model performance. Published analytics-ready datasets for BI tools (Tableau/Power BI) and supported self-service analytics; guided GenAI data strategy and AI-ready data modeling, enabling scalable analytics and ML workflows.
Data Engineer at Sutherland
September 1, 2017 - August 1, 2019
Built scalable Spark jobs in Python to ingest payroll and benefits data into ML pipelines for fraud and revenue predictions. Managed feature stores using Feast and PGVector, and engineered end-to-end data enrichment with Airflow integrations and GitHub Actions for deployments. Established observability with Datadog, implemented drift detection and data quality checks, and mentored junior engineers while driving GenAI data strategy and CI/CD best practices.

Education

Add your educational history here.

Qualifications

Bachelor's in Computer Science
January 11, 2030 - January 29, 2026

Industry Experience

Healthcare, Life Sciences, Financial Services, Professional Services, Software & Internet