Hi, I’m Nilay Borsadiya. I’m a data engineer with over 4 years of hands-on experience building robust, scalable data infrastructure across healthcare and logistics. I design and productionize ETL pipelines using PySpark, Python, and cloud platforms to enable timely, reliable insights while prioritizing data quality and governance. Hi, I enjoy turning complex data into actionable insights, partnering with cross-functional teams to deliver dashboards, data lakes, and governance that protect sensitive data and accelerate decision-making.

Nilay Borsadiya

Hi, I’m Nilay Borsadiya. I’m a data engineer with over 4 years of hands-on experience building robust, scalable data infrastructure across healthcare and logistics. I design and productionize ETL pipelines using PySpark, Python, and cloud platforms to enable timely, reliable insights while prioritizing data quality and governance. Hi, I enjoy turning complex data into actionable insights, partnering with cross-functional teams to deliver dashboards, data lakes, and governance that protect sensitive data and accelerate decision-making.

Available to hire

Hi, I’m Nilay Borsadiya. I’m a data engineer with over 4 years of hands-on experience building robust, scalable data infrastructure across healthcare and logistics. I design and productionize ETL pipelines using PySpark, Python, and cloud platforms to enable timely, reliable insights while prioritizing data quality and governance.

Hi, I enjoy turning complex data into actionable insights, partnering with cross-functional teams to deliver dashboards, data lakes, and governance that protect sensitive data and accelerate decision-making.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Data Engineer at McKesson
January 1, 2024 - November 17, 2025
Developed and maintained scalable, production-grade ETL pipelines using PySpark and AWS Glue to ingest, transform, and load over 10TB of structured data. Implemented automated reporting with Power BI and Excel VBA, reducing reporting time by 20% and manual errors by 25%. Designed an end-to-end data lake on Amazon S3 integrated with Redshift Spectrum and AWS Athena, accelerating time to insights by 40%. Established automated data quality frameworks using Great Expectations within ETL workflows, improving data accuracy and regulatory compliance for healthcare data. Built modular data ingestion components with Python, AWS Lambda, and SQS to onboard data sources quickly, boosting onboarding speed by 60%. Orchestrated batch and streaming workflows with Apache Airflow and AWS Step Functions, ensuring data availability with strict SLAs and real-time monitoring.
Data Engineer at Metasystems
February 1, 2022 - February 1, 2022
Designed and developed robust batch and real-time data pipelines using Apache Kafka for streaming, Apache Spark for distributed processing, and SQL for transformation to handle high-volume logistics data. Led modernization of on-prem ETL processes by migrating to cloud-native architectures using Azure Data Factory and Azure Synapse Analytics, improving pipeline execution speed by 45%. Modeled enterprise-wide dimensional schemas in Snowflake to support BI reporting across logistics and supply chain operations. Developed reusable Python scripts to standardize structured and unstructured data, ensuring governance compliance and enhancing downstream analytics. Collaborated with DevOps to implement real-time monitoring and alerting using AWS CloudWatch, Azure Monitor, and Prometheus, minimizing downtime. Leveraged Pandas, NumPy, and Scikit-learn for preprocessing and analysis to extract actionable insights.
Data Engineer at Metasystems
January 1, 2020 - February 1, 2022
Designed and developed batch and real-time data pipelines with Kafka, Spark, and SQL to handle high-volume logistics. Migrated on-prem ETL to Azure Data Factory and Azure Synapse Analytics, accelerating pipeline speed by 45%. Modeled enterprise schemas in Snowflake using star and snowflake models to support BI across logistics and supply chain. Built reusable Python transformations to standardize data formats and enforce governance. Implemented real-time monitoring with AWS CloudWatch, Azure Monitor, and Prometheus to minimize downtime. Leveraged Pandas, NumPy, and Scikit-learn to preprocess large datasets and extract insights shaping data strategies.

Education

Post-Graduation in Artificial Intelligence and Machine Learning at Lambton College, Canada
January 11, 2030 - November 17, 2025
Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - November 17, 2025
Post-Graduation, Artificial Intelligence and Machine Learning at Lambton College
January 11, 2030 - January 12, 2026
Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - January 12, 2026
Post-Graduation, Artificial Intelligence and Machine Learning at Lambton College, Canada
January 11, 2030 - January 12, 2026
Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - January 12, 2026
Post-Graduation, Artificial Intelligence and Machine Learning at Lambton College, Canada
January 11, 2030 - January 12, 2026
Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - January 12, 2026
Post-Graduation, Artificial Intelligence and Machine Learning at Lambton College, Canada
January 11, 2030 - March 2, 2026
Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - March 2, 2026
Post-Graduation in Artificial Intelligence and Machine Learning at Lambton College
January 11, 2030 - March 2, 2026
Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - March 2, 2026

Qualifications

Python and SQL for Data Analysis
January 11, 2030 - November 17, 2025
Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - November 17, 2025
Python and SQL for Data Analysis
January 11, 2030 - January 12, 2026
Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - January 12, 2026
Python and SQL for Data Analysis
January 11, 2030 - January 12, 2026
Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - January 12, 2026
Python and SQL for Data Analysis
January 11, 2030 - January 12, 2026
Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - January 12, 2026
Python and SQL for Data Analysis
January 11, 2030 - March 2, 2026
Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - March 2, 2026
Python and SQL for Data Analysis
January 11, 2030 - March 2, 2026
Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - March 2, 2026

Industry Experience

Healthcare, Transportation & Logistics, Software & Internet, Professional Services, Education, Manufacturing