I am a data engineer specializing in building secure, scalable, and high-performance data platforms with over 6 years of experience. My expertise spans the healthcare, insurance, and analytics domains where I design real-time and batch pipelines using PySpark, Kafka, and dbt across AWS, Azure, and GCP. With strong skills in modern data warehousing and workflow orchestration, I build HIPAA-compliant pipelines that strengthen governance and support reverse ETL operations. I am a certified AWS Data Engineer Associate with a proven track record of delivering production-grade solutions that reduce costs, optimize performance, and drive data-informed decision-making at scale. I enjoy collaborating with cross-functional teams to prepare feature engineering pipelines that power AI-driven applications and continuously contribute to Agile processes and documentation to foster knowledge sharing and operational excellence.

Sushil Bhandari

I am a data engineer specializing in building secure, scalable, and high-performance data platforms with over 6 years of experience. My expertise spans the healthcare, insurance, and analytics domains where I design real-time and batch pipelines using PySpark, Kafka, and dbt across AWS, Azure, and GCP. With strong skills in modern data warehousing and workflow orchestration, I build HIPAA-compliant pipelines that strengthen governance and support reverse ETL operations. I am a certified AWS Data Engineer Associate with a proven track record of delivering production-grade solutions that reduce costs, optimize performance, and drive data-informed decision-making at scale. I enjoy collaborating with cross-functional teams to prepare feature engineering pipelines that power AI-driven applications and continuously contribute to Agile processes and documentation to foster knowledge sharing and operational excellence.

Available to hire

I am a data engineer specializing in building secure, scalable, and high-performance data platforms with over 6 years of experience. My expertise spans the healthcare, insurance, and analytics domains where I design real-time and batch pipelines using PySpark, Kafka, and dbt across AWS, Azure, and GCP. With strong skills in modern data warehousing and workflow orchestration, I build HIPAA-compliant pipelines that strengthen governance and support reverse ETL operations.

I am a certified AWS Data Engineer Associate with a proven track record of delivering production-grade solutions that reduce costs, optimize performance, and drive data-informed decision-making at scale. I enjoy collaborating with cross-functional teams to prepare feature engineering pipelines that power AI-driven applications and continuously contribute to Agile processes and documentation to foster knowledge sharing and operational excellence.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent
Japanese
Intermediate
Nepali
Intermediate
Hindi
Intermediate

Work Experience

Data Engineer / Data Scientist at Spring Health, New York, NY
December 1, 2022 - Present
Designed and optimized ETL/ELT pipelines using Python, AWS Glue, Lambda, Step Functions, and Snowflake for healthcare datasets to enable AI/ML applications. Built real-time streaming pipelines with Kafka and Airflow, orchestrated automation of AI-ready datasets, and collaborated to develop feature engineering pipelines that accelerated model development. Implemented ML pipelines on Spark and Databricks, and leveraged generative AI for clinical text summarization and patient insights. Automated CI/CD deployments with Terraform, Docker, and Kubernetes, ensuring HIPAA and GDPR compliance and improving AI model accuracy and deployment efficiency.
Data Engineer at Wells Fargo, Washington, DC
September 30, 2022 - August 27, 2025
Led migration of data infrastructure to Azure and Google Cloud resulting in 20% cost reduction. Developed and optimized ETL pipelines using Apache Spark, Azure Data Factory, GCP Dataflow, and Kafka to improve processing times and pipeline reliability. Built data warehousing solutions enhancing reporting speed and enabled real-time insights. Implemented security compliance with encryption and IAM roles, developed CI/CD pipelines, and integrated ML models for fraud detection and financial forecasting, improving model accuracy by 30%. Mentored junior engineers and improved team productivity.
Data Engineer at Texas Mutual Insurance Company, Austin, TX
January 31, 2020 - August 27, 2025
Designed and implemented ETL workflows optimizing claims and underwriting data processing by 30%. Developed Amazon Redshift data warehouse for real-time reporting and improved retrieval by 25%. Automated data pipelines integrating legacy systems with cloud platforms reducing integration time by 35%. Created dashboards with Tableau for claims and risk management teams. Ensured data security and compliance (HIPAA, GDPR) with role-based access controls and encryption. Led cloud migration enhancing scalability and reducing costs and mentored junior engineers increasing team productivity.
Data Engineer at Spring Health, New York, NY
December 1, 2022 - Present
Built and maintained ETL pipelines using Python, AWS Glue, Lambda, Step Functions, and Snowflake to streamline data ingestion, transformation, and integration for behavioral health data. Designed near real-time pipelines with Apache Airflow, AWS Step Functions, and Kafka managing streaming and batch workloads. Developed and optimized data models in SQL, dbt, and Snowflake, reducing query complexity by 50% and improving reporting speed. Managed hybrid cloud storage across AWS S3, Lake Formation, and Google Cloud Storage. Improved AWS Redshift warehouse query performance by 35%. Automated data quality checks using Great Expectations and AWS Glue DataBrew, increasing accuracy by 40%. Established fault-tolerant data lakes using Parquet, Avro, JSON, and Delta Lake formats. Implemented robust governance and security measures for HIPAA and GDPR compliance. Automated CI/CD deployments cutting release cycles from 3 days to 3 hours. Boosted batch and streaming job performance by 60% through Apac
Data Engineer at Wells Fargo, Washington, DC
September 30, 2022 - September 4, 2025
Designed and built ETL pipelines using Azure Data Factory, Databricks (PySpark/Scala), and Synapse Analytics reducing runtimes by 35%. Developed financial data marts and warehouses in Snowflake, Synapse, and SQL Server to speed regulatory and risk reporting. Engineered real-time ingestion pipelines with Kafka and Databricks Structured Streaming processing millions of financial transactions daily. Implemented data governance and lineage using Azure Purview, Collibra, dbt tests, and Great Expectations enhancing audit readiness. Delivered reverse ETL workflows syncing curated data to Salesforce, ServiceNow, improving decision-making. Secured sensitive data with Azure Key Vault, IAM RBAC, and encryption meeting SOX, PCI-DSS, FINRA compliance. Automated CI/CD pipelines with Azure DevOps, Jenkins, Terraform, and Kubernetes shortening release cycles. Orchestrated workflows with Apache Airflow and Azure Data Factory integrated monitoring with Datadog, Elastic, PagerDuty. Tuned large queries to
Data Engineer at Texas Mutual Insurance Company, Austin, TX
January 31, 2020 - September 4, 2025
Built scalable ETL pipelines using AWS Glue (PySpark), Redshift, and Snowflake improving claims and underwriting data processing by 30%. Developed actuarial and risk data marts in Snowflake and Redshift Spectrum reducing report latency by 25%. Automated data ingestion from Oracle ERP, SAP, and policy systems using Talend, AWS DMS, and custom frameworks decreasing manual effort. Designed data lakes on AWS S3 with Parquet and ORC for structured and semi-structured data access. Modeled data with dbt, SQL, Hive using star and vault schemas supporting actuarial analytics and fraud detection. Applied data quality practices with Great Expectations and AWS Glue DataBrew increasing data reliability. Built dashboards in Tableau and Power BI presenting KPIs on claims and renewals. Enforced HIPAA and GDPR compliance with AWS IAM, KMS encryption, RBAC securing PHI and PII. Automated provisioning with Terraform and CloudFormation reducing setup time by 40%. Partnered with actuaries and data scientis

Education

Master’s in Information Technology Management at Webster University, San Antonio, TX
January 11, 2030 - August 27, 2025
Master’s in information technology management at Webster University, San Antonio, TX
January 11, 2030 - September 4, 2025

Qualifications

AWS Certified Data Engineer - Associate
January 11, 2030 - August 27, 2025
Databricks Certified Data Engineer Associate
January 11, 2030 - August 27, 2025
SnowPro Advanced: Data Engineer
January 11, 2030 - August 27, 2025
AWS Certified Data Engineer Associate
January 11, 2030 - September 4, 2025
Databricks Certified Data Engineer Associate
January 11, 2030 - September 4, 2025
SnowPro Advanced: Data Engineer
January 11, 2030 - September 4, 2025

Industry Experience

Healthcare, Financial Services, Software & Internet, Life Sciences, Professional Services